Overview

Dataset statistics

Number of variables145
Number of observations2260668
Missing cells79651908
Missing cells (%)24.3%
Total size in memory2.4 GiB
Average record size in memory1.1 KiB

Variable types

Numeric105
Categorical36
Unsupported3
Boolean1

Alerts

deferral_term has constant value "3.0"Constant
hardship_length has constant value "3.0"Constant
policy_code has constant value "True"Constant
addr_state has a high cardinality: 51 distinct valuesHigh cardinality
debt_settlement_flag_date has a high cardinality: 83 distinct valuesHigh cardinality
desc has a high cardinality: 124486 distinct valuesHigh cardinality
earliest_cr_line has a high cardinality: 755 distinct valuesHigh cardinality
emp_title has a high cardinality: 483752 distinct valuesHigh cardinality
issue_d has a high cardinality: 139 distinct valuesHigh cardinality
last_credit_pull_d has a high cardinality: 141 distinct valuesHigh cardinality
last_pymnt_d has a high cardinality: 136 distinct valuesHigh cardinality
next_pymnt_d has a high cardinality: 106 distinct valuesHigh cardinality
sec_app_earliest_cr_line has a high cardinality: 664 distinct valuesHigh cardinality
settlement_date has a high cardinality: 90 distinct valuesHigh cardinality
title has a high cardinality: 61465 distinct valuesHigh cardinality
zip_code has a high cardinality: 957 distinct valuesHigh cardinality
application_type is highly imbalanced (69.9%)Imbalance
debt_settlement_flag is highly imbalanced (89.0%)Imbalance
debt_settlement_flag_date is highly imbalanced (97.1%)Imbalance
desc is highly imbalanced (90.4%)Imbalance
disbursement_method is highly imbalanced (78.3%)Imbalance
hardship_end_date is highly imbalanced (98.6%)Imbalance
hardship_flag is highly imbalanced (99.5%)Imbalance
hardship_loan_status is highly imbalanced (97.9%)Imbalance
hardship_reason is highly imbalanced (98.3%)Imbalance
hardship_start_date is highly imbalanced (98.6%)Imbalance
hardship_status is highly imbalanced (97.5%)Imbalance
hardship_type is highly imbalanced (95.5%)Imbalance
last_credit_pull_d is highly imbalanced (59.1%)Imbalance
loan_status is highly imbalanced (51.8%)Imbalance
next_pymnt_d is highly imbalanced (85.1%)Imbalance
payment_plan_start_date is highly imbalanced (98.6%)Imbalance
pymnt_plan is highly imbalanced (99.6%)Imbalance
sec_app_earliest_cr_line is highly imbalanced (92.8%)Imbalance
settlement_date is highly imbalanced (97.0%)Imbalance
settlement_status is highly imbalanced (93.2%)Imbalance
title is highly imbalanced (79.3%)Imbalance
verification_status_joint is highly imbalanced (81.6%)Imbalance
acc_open_past_24mths has 50030 (2.2%) missing valuesMissing
all_util has 866348 (38.3%) missing valuesMissing
annual_inc_joint has 2139958 (94.7%) missing valuesMissing
avg_cur_bal has 70346 (3.1%) missing valuesMissing
bc_open_to_buy has 74935 (3.3%) missing valuesMissing
bc_util has 76071 (3.4%) missing valuesMissing
debt_settlement_flag_date has 92352 (4.1%) missing valuesMissing
deferral_term has 2250055 (99.5%) missing valuesMissing
desc has 549383 (24.3%) missing valuesMissing
dti_joint has 2139962 (94.7%) missing valuesMissing
hardship_amount has 2250055 (99.5%) missing valuesMissing
hardship_dpd has 2250055 (99.5%) missing valuesMissing
hardship_end_date has 95321 (4.2%) missing valuesMissing
hardship_last_payment_amount has 2250055 (99.5%) missing valuesMissing
hardship_length has 2250055 (99.5%) missing valuesMissing
hardship_loan_status has 95321 (4.2%) missing valuesMissing
hardship_payoff_balance_amount has 2250055 (99.5%) missing valuesMissing
hardship_reason has 95321 (4.2%) missing valuesMissing
hardship_start_date has 95321 (4.2%) missing valuesMissing
hardship_status has 95321 (4.2%) missing valuesMissing
hardship_type has 95321 (4.2%) missing valuesMissing
id has 2260668 (100.0%) missing valuesMissing
il_util has 1068850 (47.3%) missing valuesMissing
inq_fi has 866129 (38.3%) missing valuesMissing
inq_last_12m has 866130 (38.3%) missing valuesMissing
max_bal_bc has 866129 (38.3%) missing valuesMissing
member_id has 2260668 (100.0%) missing valuesMissing
mo_sin_old_il_acct has 139071 (6.2%) missing valuesMissing
mo_sin_old_rev_tl_op has 70277 (3.1%) missing valuesMissing
mo_sin_rcnt_rev_tl_op has 70277 (3.1%) missing valuesMissing
mo_sin_rcnt_tl has 70276 (3.1%) missing valuesMissing
mort_acc has 50030 (2.2%) missing valuesMissing
mths_since_last_delinq has 1158502 (51.2%) missing valuesMissing
mths_since_last_major_derog has 1679893 (74.3%) missing valuesMissing
mths_since_last_record has 1901512 (84.1%) missing valuesMissing
mths_since_rcnt_il has 909924 (40.3%) missing valuesMissing
mths_since_recent_bc has 73412 (3.2%) missing valuesMissing
mths_since_recent_bc_dlq has 1740967 (77.0%) missing valuesMissing
mths_since_recent_inq has 295435 (13.1%) missing valuesMissing
mths_since_recent_revol_delinq has 1520309 (67.3%) missing valuesMissing
num_accts_ever_120_pd has 70276 (3.1%) missing valuesMissing
num_actv_bc_tl has 70276 (3.1%) missing valuesMissing
num_actv_rev_tl has 70276 (3.1%) missing valuesMissing
num_bc_sats has 58590 (2.6%) missing valuesMissing
num_bc_tl has 70276 (3.1%) missing valuesMissing
num_il_tl has 70276 (3.1%) missing valuesMissing
num_op_rev_tl has 70276 (3.1%) missing valuesMissing
num_rev_accts has 70277 (3.1%) missing valuesMissing
num_rev_tl_bal_gt_0 has 70276 (3.1%) missing valuesMissing
num_sats has 58590 (2.6%) missing valuesMissing
num_tl_120dpd_2m has 153657 (6.8%) missing valuesMissing
num_tl_30dpd has 70276 (3.1%) missing valuesMissing
num_tl_90g_dpd_24m has 70276 (3.1%) missing valuesMissing
num_tl_op_past_12m has 70276 (3.1%) missing valuesMissing
open_acc_6m has 866130 (38.3%) missing valuesMissing
open_act_il has 866129 (38.3%) missing valuesMissing
open_il_12m has 866129 (38.3%) missing valuesMissing
open_il_24m has 866129 (38.3%) missing valuesMissing
open_rv_12m has 866129 (38.3%) missing valuesMissing
open_rv_24m has 866129 (38.3%) missing valuesMissing
orig_projected_additional_accrued_interest has 2252242 (99.6%) missing valuesMissing
payment_plan_start_date has 95321 (4.2%) missing valuesMissing
pct_tl_nvr_dlq has 70431 (3.1%) missing valuesMissing
percent_bc_gt_75 has 75379 (3.3%) missing valuesMissing
revol_bal_joint has 2152648 (95.2%) missing valuesMissing
sec_app_chargeoff_within_12_mths has 2152647 (95.2%) missing valuesMissing
sec_app_collections_12_mths_ex_med has 2152647 (95.2%) missing valuesMissing
sec_app_inq_last_6mths has 2152647 (95.2%) missing valuesMissing
sec_app_mort_acc has 2152647 (95.2%) missing valuesMissing
sec_app_mths_since_last_major_derog has 2224726 (98.4%) missing valuesMissing
sec_app_num_rev_accts has 2152647 (95.2%) missing valuesMissing
sec_app_open_acc has 2152647 (95.2%) missing valuesMissing
sec_app_open_act_il has 2152647 (95.2%) missing valuesMissing
sec_app_revol_util has 2154484 (95.3%) missing valuesMissing
settlement_amount has 2227612 (98.5%) missing valuesMissing
settlement_date has 92352 (4.1%) missing valuesMissing
settlement_percentage has 2227612 (98.5%) missing valuesMissing
settlement_status has 92352 (4.1%) missing valuesMissing
settlement_term has 2227612 (98.5%) missing valuesMissing
tot_coll_amt has 70276 (3.1%) missing valuesMissing
tot_cur_bal has 70276 (3.1%) missing valuesMissing
tot_hi_cred_lim has 70276 (3.1%) missing valuesMissing
total_bal_ex_mort has 50030 (2.2%) missing valuesMissing
total_bal_il has 866129 (38.3%) missing valuesMissing
total_bc_limit has 50030 (2.2%) missing valuesMissing
total_cu_tl has 866130 (38.3%) missing valuesMissing
total_il_high_credit_limit has 70276 (3.1%) missing valuesMissing
total_rev_hi_lim has 70276 (3.1%) missing valuesMissing
url has 2260668 (100.0%) missing valuesMissing
acc_now_delinq is highly skewed (γ1 = 22.90797767)Skewed
annual_inc is highly skewed (γ1 = 493.8860884)Skewed
annual_inc_joint is highly skewed (γ1 = 21.7445355)Skewed
delinq_amnt is highly skewed (γ1 = 102.6547743)Skewed
dti is highly skewed (γ1 = 29.20185447)Skewed
num_tl_120dpd_2m is highly skewed (γ1 = 55.80984712)Skewed
num_tl_30dpd is highly skewed (γ1 = 22.51746312)Skewed
sec_app_chargeoff_within_12_mths is highly skewed (γ1 = 20.27699345)Skewed
tax_liens is highly skewed (γ1 = 32.07091145)Skewed
tot_coll_amt is highly skewed (γ1 = 852.0101323)Skewed
total_rec_late_fee is highly skewed (γ1 = 21.84586707)Skewed
total_rev_hi_lim is highly skewed (γ1 = 32.55742738)Skewed
id is an unsupported type, check if it needs cleaning or further analysisUnsupported
member_id is an unsupported type, check if it needs cleaning or further analysisUnsupported
url is an unsupported type, check if it needs cleaning or further analysisUnsupported
acc_now_delinq has 2251857 (99.6%) zerosZeros
acc_open_past_24mths has 100270 (4.4%) zerosZeros
bc_open_to_buy has 30767 (1.4%) zerosZeros
bc_util has 27885 (1.2%) zerosZeros
chargeoff_within_12_mths has 2243339 (99.2%) zerosZeros
collection_recovery_fee has 2091587 (92.5%) zerosZeros
collections_12_mths_ex_med has 2223085 (98.3%) zerosZeros
delinq_2yrs has 1839108 (81.4%) zerosZeros
delinq_amnt has 2253465 (99.7%) zerosZeros
inq_fi has 697142 (30.8%) zerosZeros
inq_last_12m has 400090 (17.7%) zerosZeros
inq_last_6mths has 1381722 (61.1%) zerosZeros
max_bal_bc has 35917 (1.6%) zerosZeros
mo_sin_rcnt_rev_tl_op has 33747 (1.5%) zerosZeros
mo_sin_rcnt_tl has 34923 (1.5%) zerosZeros
mort_acc has 929606 (41.1%) zerosZeros
mths_since_recent_inq has 168927 (7.5%) zerosZeros
num_accts_ever_120_pd has 1687416 (74.6%) zerosZeros
num_actv_bc_tl has 50061 (2.2%) zerosZeros
num_bc_sats has 23661 (1.0%) zerosZeros
num_il_tl has 68944 (3.0%) zerosZeros
num_tl_120dpd_2m has 2105738 (93.1%) zerosZeros
num_tl_30dpd has 2184561 (96.6%) zerosZeros
num_tl_90g_dpd_24m has 2073060 (91.7%) zerosZeros
num_tl_op_past_12m has 415975 (18.4%) zerosZeros
open_acc_6m has 627966 (27.8%) zerosZeros
open_act_il has 165848 (7.3%) zerosZeros
open_il_12m has 760254 (33.6%) zerosZeros
open_il_24m has 377489 (16.7%) zerosZeros
open_rv_12m has 513716 (22.7%) zerosZeros
open_rv_24m has 223783 (9.9%) zerosZeros
out_prncp has 1312200 (58.0%) zerosZeros
out_prncp_inv has 1312200 (58.0%) zerosZeros
percent_bc_gt_75 has 598711 (26.5%) zerosZeros
pub_rec has 1902758 (84.2%) zerosZeros
pub_rec_bankruptcies has 1987383 (87.9%) zerosZeros
recoveries has 2083167 (92.1%) zerosZeros
sec_app_chargeoff_within_12_mths has 105117 (4.6%) zerosZeros
sec_app_collections_12_mths_ex_med has 101793 (4.5%) zerosZeros
sec_app_inq_last_6mths has 65252 (2.9%) zerosZeros
sec_app_mort_acc has 42218 (1.9%) zerosZeros
tax_liens has 2195933 (97.1%) zerosZeros
tot_coll_amt has 1856129 (82.1%) zerosZeros
total_bal_il has 158666 (7.0%) zerosZeros
total_bc_limit has 25349 (1.1%) zerosZeros
total_cu_tl has 753128 (33.3%) zerosZeros
total_il_high_credit_limit has 263497 (11.7%) zerosZeros
total_rec_late_fee has 2176107 (96.3%) zerosZeros

Reproduction

Analysis started2023-04-17 02:54:08.491947
Analysis finished2023-04-17 02:54:34.182919
Duration25.69 seconds
Software versionpandas-profiling vv3.6.1
Download configurationconfig.json

Variables

acc_now_delinq
Real number (ℝ)

SKEWED  ZEROS 

Distinct9
Distinct (%)< 0.1%
Missing29
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean0.004147942241
Minimum0
Maximum14
Zeros2251857
Zeros (%)99.6%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:34.232142image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum14
Range14
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.06961656266
Coefficient of variation (CV)16.78339731
Kurtosis1256.702032
Mean0.004147942241
Median Absolute Deviation (MAD)0
Skewness22.90797767
Sum9377
Variance0.004846465797
MonotonicityNot monotonic
2023-04-16T23:54:34.279590image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
0 2251857
99.6%
1 8293
 
0.4%
2 421
 
< 0.1%
3 50
 
< 0.1%
4 11
 
< 0.1%
5 3
 
< 0.1%
6 2
 
< 0.1%
14 1
 
< 0.1%
7 1
 
< 0.1%
(Missing) 29
 
< 0.1%
ValueCountFrequency (%)
0 2251857
99.6%
1 8293
 
0.4%
2 421
 
< 0.1%
3 50
 
< 0.1%
4 11
 
< 0.1%
ValueCountFrequency (%)
14 1
 
< 0.1%
7 1
 
< 0.1%
6 2
 
< 0.1%
5 3
 
< 0.1%
4 11
< 0.1%

acc_open_past_24mths
Real number (ℝ)

MISSING  ZEROS 

Distinct57
Distinct (%)< 0.1%
Missing50030
Missing (%)2.2%
Infinite0
Infinite (%)0.0%
Mean4.521655739
Minimum0
Maximum64
Zeros100270
Zeros (%)4.4%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:34.374179image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q12
median4
Q36
95-th percentile10
Maximum64
Range64
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.164229436
Coefficient of variation (CV)0.6997944157
Kurtosis4.374804415
Mean4.521655739
Median Absolute Deviation (MAD)2
Skewness1.402234451
Sum9995744
Variance10.01234792
MonotonicityNot monotonic
2023-04-16T23:54:34.466482image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3 331293
14.7%
4 307313
13.6%
2 305502
13.5%
5 258117
11.4%
1 222952
9.9%
6 201931
8.9%
7 149447
6.6%
8 106194
 
4.7%
0 100270
 
4.4%
9 73503
 
3.3%
Other values (47) 154116
6.8%
ValueCountFrequency (%)
0 100270
 
4.4%
1 222952
9.9%
2 305502
13.5%
3 331293
14.7%
4 307313
13.6%
ValueCountFrequency (%)
64 1
 
< 0.1%
61 1
 
< 0.1%
56 1
 
< 0.1%
55 1
 
< 0.1%
54 3
< 0.1%

addr_state
Categorical

Distinct51
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.2 MiB
CA
314533 
NY
186389 
TX
186335 
FL
161991 
IL
 
91173
Other values (46)
1320247 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNY
2nd rowLA
3rd rowMI
4th rowWA
5th rowMD

Common Values

ValueCountFrequency (%)
CA 314533
 
13.9%
NY 186389
 
8.2%
TX 186335
 
8.2%
FL 161991
 
7.2%
IL 91173
 
4.0%
NJ 83132
 
3.7%
PA 76939
 
3.4%
OH 75132
 
3.3%
GA 74196
 
3.3%
VA 62954
 
2.8%
Other values (41) 947894
41.9%

all_util
Real number (ℝ)

Distinct188
Distinct (%)< 0.1%
Missing866348
Missing (%)38.3%
Infinite0
Infinite (%)0.0%
Mean57.03229531
Minimum0
Maximum239
Zeros2873
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:34.560666image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile21
Q143
median58
Q372
95-th percentile90
Maximum239
Range239
Interquartile range (IQR)29

Descriptive statistics

Standard deviation20.9047476
Coefficient of variation (CV)0.3665422807
Kurtosis-0.07732273667
Mean57.03229531
Median Absolute Deviation (MAD)14
Skewness-0.1233140004
Sum79521270
Variance437.0084721
MonotonicityNot monotonic
2023-04-16T23:54:34.655644image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
59 26677
 
1.2%
60 26674
 
1.2%
61 26475
 
1.2%
62 26330
 
1.2%
63 26295
 
1.2%
58 26105
 
1.2%
64 26010
 
1.2%
57 25934
 
1.1%
65 25763
 
1.1%
55 25429
 
1.1%
Other values (178) 1132628
50.1%
(Missing) 866348
38.3%
ValueCountFrequency (%)
0 2873
0.1%
1 1720
0.1%
2 1594
0.1%
3 1676
0.1%
4 1706
0.1%
ValueCountFrequency (%)
239 1
< 0.1%
211 1
< 0.1%
210 1
< 0.1%
204 1
< 0.1%
198 1
< 0.1%

annual_inc
Real number (ℝ)

Distinct89368
Distinct (%)4.0%
Missing4
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean77992.42869
Minimum0
Maximum110000000
Zeros1667
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:35.109994image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile27600
Q146000
median65000
Q393000
95-th percentile160000
Maximum110000000
Range110000000
Interquartile range (IQR)47000

Descriptive statistics

Standard deviation112696.1996
Coefficient of variation (CV)1.444963331
Kurtosis439001.6589
Mean77992.42869
Median Absolute Deviation (MAD)22000
Skewness493.8860884
Sum1.763146758 × 1011
Variance1.27004334 × 1010
MonotonicityNot monotonic
2023-04-16T23:54:35.210441image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
60000 87189
 
3.9%
50000 76355
 
3.4%
65000 64903
 
2.9%
70000 62078
 
2.7%
80000 59833
 
2.6%
40000 59684
 
2.6%
75000 58459
 
2.6%
45000 54534
 
2.4%
55000 51583
 
2.3%
100000 46977
 
2.1%
Other values (89358) 1639069
72.5%
ValueCountFrequency (%)
0 1667
0.1%
0.36 1
 
< 0.1%
1 42
 
< 0.1%
2 1
 
< 0.1%
3 1
 
< 0.1%
ValueCountFrequency (%)
110000000 1
< 0.1%
61000000 1
< 0.1%
10999200 1
< 0.1%
9930475 1
< 0.1%
9757200 1
< 0.1%

annual_inc_joint
Real number (ℝ)

MISSING  SKEWED 

Distinct17633
Distinct (%)14.6%
Missing2139958
Missing (%)94.7%
Infinite0
Infinite (%)0.0%
Mean123624.6367
Minimum5693.51
Maximum7874821
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:35.306519image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum5693.51
5-th percentile53310.093
Q183400
median110000
Q3147995
95-th percentile230975
Maximum7874821
Range7869127.49
Interquartile range (IQR)64595

Descriptive statistics

Standard deviation74161.34633
Coefficient of variation (CV)0.5998913186
Kurtosis1741.838783
Mean123624.6367
Median Absolute Deviation (MAD)30000
Skewness21.7445355
Sum1.49227299 × 1010
Variance5499905289
MonotonicityNot monotonic
2023-04-16T23:54:35.401343image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100000 2150
 
0.1%
120000 2057
 
0.1%
110000 2053
 
0.1%
90000 1874
 
0.1%
130000 1788
 
0.1%
80000 1650
 
0.1%
105000 1550
 
0.1%
140000 1529
 
0.1%
150000 1460
 
0.1%
115000 1460
 
0.1%
Other values (17623) 103139
 
4.6%
(Missing) 2139958
94.7%
ValueCountFrequency (%)
5693.51 1
< 0.1%
9000 1
< 0.1%
11000 1
< 0.1%
12500 1
< 0.1%
13464 1
< 0.1%
ValueCountFrequency (%)
7874821 1
< 0.1%
6282000 1
< 0.1%
5653500 1
< 0.1%
4200000 1
< 0.1%
2000000 1
< 0.1%

application_type
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.2 MiB
Individual
2139958 
Joint App
 
120710

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowIndividual
2nd rowIndividual
3rd rowIndividual
4th rowIndividual
5th rowIndividual

Common Values

ValueCountFrequency (%)
Individual 2139958
94.7%
Joint App 120710
 
5.3%

Common Values (Plot)

2023-04-16T23:54:35.493806image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

avg_cur_bal
Real number (ℝ)

Distinct88597
Distinct (%)4.0%
Missing70346
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean13547.79751
Minimum0
Maximum958084
Zeros906
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:35.555596image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1072
Q13080
median7335
Q318783
95-th percentile43578
Maximum958084
Range958084
Interquartile range (IQR)15703

Descriptive statistics

Standard deviation16474.07501
Coefficient of variation (CV)1.215996548
Kurtosis44.16460012
Mean13547.79751
Median Absolute Deviation (MAD)5369
Skewness3.868891119
Sum2.967403894 × 1010
Variance271395147.4
MonotonicityNot monotonic
2023-04-16T23:54:35.669910image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 906
 
< 0.1%
2442 273
 
< 0.1%
2277 271
 
< 0.1%
1971 268
 
< 0.1%
2522 267
 
< 0.1%
1967 267
 
< 0.1%
2025 266
 
< 0.1%
2148 266
 
< 0.1%
2831 265
 
< 0.1%
2286 265
 
< 0.1%
Other values (88587) 2187008
96.7%
(Missing) 70346
 
3.1%
ValueCountFrequency (%)
0 906
< 0.1%
1 67
 
< 0.1%
2 59
 
< 0.1%
3 50
 
< 0.1%
4 38
 
< 0.1%
ValueCountFrequency (%)
958084 1
< 0.1%
800008 1
< 0.1%
752994 1
< 0.1%
710392 1
< 0.1%
646339 1
< 0.1%

bc_open_to_buy
Real number (ℝ)

MISSING  ZEROS 

Distinct91500
Distinct (%)4.2%
Missing74935
Missing (%)3.3%
Infinite0
Infinite (%)0.0%
Mean11394.26269
Minimum0
Maximum711140
Zeros30767
Zeros (%)1.4%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:35.761095image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile162
Q11722
median5442
Q314187
95-th percentile42800
Maximum711140
Range711140
Interquartile range (IQR)12465

Descriptive statistics

Standard deviation16599.5344
Coefficient of variation (CV)1.456832693
Kurtosis27.09912245
Mean11394.26269
Median Absolute Deviation (MAD)4567
Skewness3.737088307
Sum2.490481597 × 1010
Variance275544542.3
MonotonicityNot monotonic
2023-04-16T23:54:35.856156image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 30767
 
1.4%
500 1954
 
0.1%
2000 1908
 
0.1%
1000 1709
 
0.1%
3000 1580
 
0.1%
2500 1395
 
0.1%
5000 1383
 
0.1%
1500 1300
 
0.1%
4000 1234
 
0.1%
3500 1160
 
0.1%
Other values (91490) 2141343
94.7%
(Missing) 74935
 
3.3%
ValueCountFrequency (%)
0 30767
1.4%
1 342
 
< 0.1%
2 354
 
< 0.1%
3 358
 
< 0.1%
4 384
 
< 0.1%
ValueCountFrequency (%)
711140 1
< 0.1%
605996 1
< 0.1%
559912 1
< 0.1%
507259 1
< 0.1%
497445 1
< 0.1%

bc_util
Real number (ℝ)

MISSING  ZEROS 

Distinct1494
Distinct (%)0.1%
Missing76071
Missing (%)3.4%
Infinite0
Infinite (%)0.0%
Mean57.89994754
Minimum0
Maximum339.6
Zeros27885
Zeros (%)1.2%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:35.950385image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile7.7
Q135.4
median60.2
Q383.1
95-th percentile97.8
Maximum339.6
Range339.6
Interquartile range (IQR)47.7

Descriptive statistics

Standard deviation28.58347454
Coefficient of variation (CV)0.4936701284
Kurtosis-1.001197473
Mean57.89994754
Median Absolute Deviation (MAD)23.8
Skewness-0.2697763948
Sum126488051.7
Variance817.0150167
MonotonicityNot monotonic
2023-04-16T23:54:36.041708image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 27885
 
1.2%
98 6188
 
0.3%
97 5748
 
0.3%
99 5700
 
0.3%
96 5642
 
0.2%
95 5223
 
0.2%
94 4983
 
0.2%
93 4693
 
0.2%
92 4594
 
0.2%
91 4381
 
0.2%
Other values (1484) 2109560
93.3%
(Missing) 76071
 
3.4%
ValueCountFrequency (%)
0 27885
1.2%
0.1 2267
 
0.1%
0.2 1959
 
0.1%
0.3 1680
 
0.1%
0.4 1440
 
0.1%
ValueCountFrequency (%)
339.6 1
< 0.1%
318.2 1
< 0.1%
255.2 1
< 0.1%
252.3 1
< 0.1%
243.8 1
< 0.1%

chargeoff_within_12_mths
Real number (ℝ)

Distinct11
Distinct (%)< 0.1%
Missing145
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean0.008464412881
Minimum0
Maximum10
Zeros2243339
Zeros (%)99.2%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:36.120311image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum10
Range10
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.1048097892
Coefficient of variation (CV)12.38240509
Kurtosis598.1324648
Mean0.008464412881
Median Absolute Deviation (MAD)0
Skewness18.12854845
Sum19134
Variance0.01098509191
MonotonicityNot monotonic
2023-04-16T23:54:36.198467image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
0 2243339
99.2%
1 15765
 
0.7%
2 1115
 
< 0.1%
3 186
 
< 0.1%
4 68
 
< 0.1%
5 22
 
< 0.1%
6 12
 
< 0.1%
7 8
 
< 0.1%
9 5
 
< 0.1%
8 2
 
< 0.1%
(Missing) 145
 
< 0.1%
ValueCountFrequency (%)
0 2243339
99.2%
1 15765
 
0.7%
2 1115
 
< 0.1%
3 186
 
< 0.1%
4 68
 
< 0.1%
ValueCountFrequency (%)
10 1
 
< 0.1%
9 5
< 0.1%
8 2
 
< 0.1%
7 8
< 0.1%
6 12
< 0.1%

collection_recovery_fee
Real number (ℝ)

Distinct140449
Distinct (%)6.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22.5932837
Minimum0
Maximum7174.719
Zeros2091587
Zeros (%)92.5%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:36.276528image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile117.82377
Maximum7174.719
Range7174.719
Interquartile range (IQR)0

Descriptive statistics

Standard deviation127.1113615
Coefficient of variation (CV)5.62606849
Kurtosis245.326771
Mean22.5932837
Median Absolute Deviation (MAD)0
Skewness11.80869926
Sum51075913.48
Variance16157.29823
MonotonicityNot monotonic
2023-04-16T23:54:36.370743image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2091587
92.5%
18 633
 
< 0.1%
9 592
 
< 0.1%
27 475
 
< 0.1%
36 368
 
< 0.1%
4.5 260
 
< 0.1%
45 224
 
< 0.1%
54 219
 
< 0.1%
13.5 218
 
< 0.1%
72 165
 
< 0.1%
Other values (140439) 165927
 
7.3%
ValueCountFrequency (%)
0 2091587
92.5%
0.018 1
 
< 0.1%
0.036 1
 
< 0.1%
0.0378 1
 
< 0.1%
0.0449999999 1
 
< 0.1%
ValueCountFrequency (%)
7174.719 1
< 0.1%
7002.19 1
< 0.1%
6972.59 1
< 0.1%
6687.6228 1
< 0.1%
6584.1372 1
< 0.1%
Distinct16
Distinct (%)< 0.1%
Missing145
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean0.01814580077
Minimum0
Maximum20
Zeros2223085
Zeros (%)98.3%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:36.450347image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum20
Range20
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.1508131422
Coefficient of variation (CV)8.311186928
Kurtosis607.1260844
Mean0.01814580077
Median Absolute Deviation (MAD)0
Skewness14.03257434
Sum41019
Variance0.02274460386
MonotonicityNot monotonic
2023-04-16T23:54:36.515213image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
0 2223085
98.3%
1 34684
 
1.5%
2 2313
 
0.1%
3 271
 
< 0.1%
4 93
 
< 0.1%
5 36
 
< 0.1%
6 17
 
< 0.1%
7 7
 
< 0.1%
8 4
 
< 0.1%
9 4
 
< 0.1%
Other values (6) 9
 
< 0.1%
(Missing) 145
 
< 0.1%
ValueCountFrequency (%)
0 2223085
98.3%
1 34684
 
1.5%
2 2313
 
0.1%
3 271
 
< 0.1%
4 93
 
< 0.1%
ValueCountFrequency (%)
20 2
< 0.1%
16 1
< 0.1%
14 1
< 0.1%
12 2
< 0.1%
11 1
< 0.1%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.2 MiB
N
2227612 
Y
 
33056

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowN
2nd rowN
3rd rowN
4th rowN
5th rowN

Common Values

ValueCountFrequency (%)
N 2227612
98.5%
Y 33056
 
1.5%

Common Values (Plot)

2023-04-16T23:54:36.608353image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

debt_settlement_flag_date
Categorical

HIGH CARDINALITY  IMBALANCE  MISSING 

Distinct83
Distinct (%)< 0.1%
Missing92352
Missing (%)4.1%
Memory size17.2 MiB
2135260 
Feb-2019
 
2730
Jan-2019
 
2617
Oct-2018
 
2426
Dec-2018
 
2317
Other values (78)
 
22966

Unique

Unique9 ?
Unique (%)< 0.1%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
2135260
94.5%
Feb-2019 2730
 
0.1%
Jan-2019 2617
 
0.1%
Oct-2018 2426
 
0.1%
Dec-2018 2317
 
0.1%
Nov-2018 2277
 
0.1%
Aug-2018 2064
 
0.1%
Jun-2018 1921
 
0.1%
Jul-2018 1608
 
0.1%
Sep-2018 1592
 
0.1%
Other values (73) 13504
 
0.6%
(Missing) 92352
 
4.1%

deferral_term
Real number (ℝ)

CONSTANT  MISSING 

Distinct1
Distinct (%)< 0.1%
Missing2250055
Missing (%)99.5%
Infinite0
Infinite (%)0.0%
Mean3
Minimum3
Maximum3
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:36.661202image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile3
Q13
median3
Q33
95-th percentile3
Maximum3
Range0
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0
Coefficient of variation (CV)0
Kurtosis0
Mean3
Median Absolute Deviation (MAD)0
Skewness0
Sum31839
Variance0
MonotonicityIncreasing
2023-04-16T23:54:36.717857image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=1)
ValueCountFrequency (%)
3 10613
 
0.5%
(Missing) 2250055
99.5%
ValueCountFrequency (%)
3 10613
0.5%
ValueCountFrequency (%)
3 10613
0.5%

delinq_2yrs
Real number (ℝ)

Distinct37
Distinct (%)< 0.1%
Missing29
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean0.3068791612
Minimum0
Maximum58
Zeros1839108
Zeros (%)81.4%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:36.796507image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile2
Maximum58
Range58
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.8672303329
Coefficient of variation (CV)2.825966839
Kurtosis73.35208962
Mean0.3068791612
Median Absolute Deviation (MAD)0
Skewness5.929811375
Sum693743
Variance0.7520884503
MonotonicityNot monotonic
2023-04-16T23:54:36.876747image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
0 1839108
81.4%
1 281353
 
12.4%
2 81289
 
3.6%
3 29542
 
1.3%
4 13179
 
0.6%
5 6599
 
0.3%
6 3717
 
0.2%
7 2062
 
0.1%
8 1223
 
0.1%
9 818
 
< 0.1%
Other values (27) 1749
 
0.1%
ValueCountFrequency (%)
0 1839108
81.4%
1 281353
 
12.4%
2 81289
 
3.6%
3 29542
 
1.3%
4 13179
 
0.6%
ValueCountFrequency (%)
58 1
< 0.1%
42 1
< 0.1%
39 1
< 0.1%
36 1
< 0.1%
35 1
< 0.1%

delinq_amnt
Real number (ℝ)

SKEWED  ZEROS 

Distinct2617
Distinct (%)0.1%
Missing29
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean12.36982773
Minimum0
Maximum249925
Zeros2253465
Zeros (%)99.7%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:36.963632image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum249925
Range249925
Interquartile range (IQR)0

Descriptive statistics

Standard deviation726.4647813
Coefficient of variation (CV)58.72877108
Kurtosis16006.0037
Mean12.36982773
Median Absolute Deviation (MAD)0
Skewness102.6547743
Sum27963715
Variance527751.0785
MonotonicityNot monotonic
2023-04-16T23:54:37.043038image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2253465
99.7%
25 124
 
< 0.1%
65000 109
 
< 0.1%
30 85
 
< 0.1%
53 72
 
< 0.1%
54 69
 
< 0.1%
75 65
 
< 0.1%
50 64
 
< 0.1%
56 58
 
< 0.1%
57 56
 
< 0.1%
Other values (2607) 6472
 
0.3%
ValueCountFrequency (%)
0 2253465
99.7%
1 9
 
< 0.1%
2 10
 
< 0.1%
3 12
 
< 0.1%
4 15
 
< 0.1%
ValueCountFrequency (%)
249925 1
< 0.1%
185408 1
< 0.1%
159177 1
< 0.1%
138474 1
< 0.1%
130778 1
< 0.1%

desc
Categorical

HIGH CARDINALITY  IMBALANCE  MISSING 

Distinct124486
Distinct (%)7.3%
Missing549383
Missing (%)24.3%
Memory size17.2 MiB
1585470 
Debt Consolidation
 
13
Borrower added on 03/17/14 > Debt consolidation<br>
 
11
Borrower added on 03/10/14 > Debt consolidation<br>
 
10
Borrower added on 02/19/14 > Debt consolidation<br>
 
9
Other values (124481)
 
125772

Unique

Unique123638 ?
Unique (%)7.2%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
1585470
70.1%
Debt Consolidation 13
 
< 0.1%
Borrower added on 03/17/14 > Debt consolidation<br> 11
 
< 0.1%
Borrower added on 03/10/14 > Debt consolidation<br> 10
 
< 0.1%
Borrower added on 02/19/14 > Debt consolidation<br> 9
 
< 0.1%
Camping Membership 8
 
< 0.1%
Borrower added on 01/29/14 > Debt consolidation<br> 8
 
< 0.1%
Borrower added on 01/22/14 > Debt consolidation<br> 7
 
< 0.1%
Borrower added on 01/15/14 > Debt consolidation<br> 7
 
< 0.1%
Borrower added on 01/03/14 > Debt consolidation<br> 6
 
< 0.1%
Other values (124476) 125736
 
5.6%
(Missing) 549383
 
24.3%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.2 MiB
Cash
2182546 
DirectPay
 
78122

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCash
2nd rowCash
3rd rowCash
4th rowCash
5th rowCash

Common Values

ValueCountFrequency (%)
Cash 2182546
96.5%
DirectPay 78122
 
3.5%

Common Values (Plot)

2023-04-16T23:54:37.134195image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

dti
Real number (ℝ)

Distinct10845
Distinct (%)0.5%
Missing1711
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean18.82419644
Minimum-1
Maximum999
Zeros1732
Zeros (%)0.1%
Negative2
Negative (%)< 0.1%
Memory size17.2 MiB
2023-04-16T23:54:37.215208image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum-1
5-th percentile4.94
Q111.89
median17.84
Q324.49
95-th percentile33.88
Maximum999
Range1000
Interquartile range (IQR)12.6

Descriptive statistics

Standard deviation14.18332854
Coefficient of variation (CV)0.7534626294
Kurtosis1755.261278
Mean18.82419644
Median Absolute Deviation (MAD)6.27
Skewness29.20185447
Sum42523050.31
Variance201.1668086
MonotonicityNot monotonic
2023-04-16T23:54:37.310732image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1732
 
0.1%
18 1584
 
0.1%
14.4 1577
 
0.1%
16.8 1576
 
0.1%
19.2 1566
 
0.1%
15.6 1506
 
0.1%
13.2 1496
 
0.1%
12 1486
 
0.1%
20.4 1424
 
0.1%
21.6 1391
 
0.1%
Other values (10835) 2243619
99.2%
(Missing) 1711
 
0.1%
ValueCountFrequency (%)
-1 2
 
< 0.1%
0 1732
0.1%
0.01 22
 
< 0.1%
0.02 35
 
< 0.1%
0.03 19
 
< 0.1%
ValueCountFrequency (%)
999 135
< 0.1%
995.6 1
 
< 0.1%
995.17 1
 
< 0.1%
994.4 1
 
< 0.1%
991.57 1
 
< 0.1%

dti_joint
Real number (ℝ)

Distinct4018
Distinct (%)3.3%
Missing2139962
Missing (%)94.7%
Infinite0
Infinite (%)0.0%
Mean19.25181706
Minimum0
Maximum69.49
Zeros18
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:37.406921image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile6.99
Q113.53
median18.84
Q324.62
95-th percentile33.0275
Maximum69.49
Range69.49
Interquartile range (IQR)11.09

Descriptive statistics

Standard deviation7.82208598
Coefficient of variation (CV)0.4063037767
Kurtosis-0.3840496448
Mean19.25181706
Median Absolute Deviation (MAD)5.54
Skewness0.2205511496
Sum2323809.83
Variance61.18502907
MonotonicityNot monotonic
2023-04-16T23:54:37.485976image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
17.32 80
 
< 0.1%
19.89 77
 
< 0.1%
18.1 77
 
< 0.1%
22.39 77
 
< 0.1%
17.96 76
 
< 0.1%
13.98 76
 
< 0.1%
18.96 76
 
< 0.1%
17.02 75
 
< 0.1%
16.97 75
 
< 0.1%
20.45 75
 
< 0.1%
Other values (4008) 119942
 
5.3%
(Missing) 2139962
94.7%
ValueCountFrequency (%)
0 18
< 0.1%
0.03 1
 
< 0.1%
0.11 1
 
< 0.1%
0.12 1
 
< 0.1%
0.13 2
 
< 0.1%
ValueCountFrequency (%)
69.49 1
< 0.1%
63.66 1
< 0.1%
61.9 1
< 0.1%
61.28 1
< 0.1%
55.52 1
< 0.1%

earliest_cr_line
Categorical

Distinct755
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.2 MiB
Sep-2004
 
15400
Sep-2003
 
15215
Sep-2005
 
14780
Aug-2003
 
14669
Aug-2004
 
14413
Other values (750)
2186191 

Unique

Unique34 ?
Unique (%)< 0.1%

Sample

1st rowApr-2001
2nd rowJun-1987
3rd rowApr-2011
4th rowFeb-2006
5th rowDec-2000

Common Values

ValueCountFrequency (%)
Sep-2004 15400
 
0.7%
Sep-2003 15215
 
0.7%
Sep-2005 14780
 
0.7%
Aug-2003 14669
 
0.6%
Aug-2004 14413
 
0.6%
Aug-2001 14355
 
0.6%
Aug-2002 14322
 
0.6%
Aug-2005 14207
 
0.6%
Aug-2006 14143
 
0.6%
Oct-2003 14108
 
0.6%
Other values (745) 2115056
93.6%

emp_length
Categorical

Distinct12
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.2 MiB
10+ years
748005 
2 years
203677 
< 1 year
189988 
3 years
180753 
1 year
148403 
Other values (7)
789842 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row10+ years
2nd row10+ years
3rd row6 years
4th row10+ years
5th row10+ years

Common Values

ValueCountFrequency (%)
10+ years 748005
33.1%
2 years 203677
 
9.0%
< 1 year 189988
 
8.4%
3 years 180753
 
8.0%
1 year 148403
 
6.6%
n/a 146907
 
6.5%
5 years 139698
 
6.2%
4 years 136605
 
6.0%
6 years 102628
 
4.5%
7 years 92695
 
4.1%
Other values (2) 171309
 
7.6%

emp_title
Categorical

Distinct483752
Distinct (%)21.4%
Missing10
Missing (%)< 0.1%
Memory size17.2 MiB
 
166934
Teacher
 
40969
Manager
 
37014
Owner
 
23556
Supervisor
 
17686
Other values (483747)
1974499 

Unique

Unique364706 ?
Unique (%)16.1%

Sample

1st rowChef
2nd rowPostmaster
3rd rowAdministrative
4th rowIT Supervisor
5th rowMechanic

Common Values

ValueCountFrequency (%)
166934
 
7.4%
Teacher 40969
 
1.8%
Manager 37014
 
1.6%
Owner 23556
 
1.0%
Supervisor 17686
 
0.8%
Registered Nurse 17027
 
0.8%
Driver 16038
 
0.7%
RN 15202
 
0.7%
Sales 14173
 
0.6%
Project Manager 11545
 
0.5%
Other values (483742) 1900514
84.1%

funded_amnt
Real number (ℝ)

Distinct1572
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15041.66406
Minimum500
Maximum40000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.6 MiB
2023-04-16T23:54:37.599011image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum500
5-th percentile3250
Q18000
median12875
Q320000
95-th percentile35000
Maximum40000
Range39500
Interquartile range (IQR)12000

Descriptive statistics

Standard deviation9188.413022
Coefficient of variation (CV)0.6108641296
Kurtosis-0.1170090387
Mean15041.66406
Median Absolute Deviation (MAD)6175
Skewness0.7787785936
Sum3.40042086 × 1010
Variance84426933.87
MonotonicityNot monotonic
2023-04-16T23:54:37.705767image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10000 187146
 
8.3%
20000 130816
 
5.8%
15000 123110
 
5.4%
12000 121588
 
5.4%
35000 86147
 
3.8%
5000 84751
 
3.7%
8000 75020
 
3.3%
6000 72075
 
3.2%
16000 66331
 
2.9%
25000 66176
 
2.9%
Other values (1562) 1247508
55.2%
ValueCountFrequency (%)
500 11
< 0.1%
550 1
 
< 0.1%
600 6
< 0.1%
700 3
 
< 0.1%
725 1
 
< 0.1%
ValueCountFrequency (%)
40000 33368
1.5%
39975 11
 
< 0.1%
39950 10
 
< 0.1%
39925 14
 
< 0.1%
39900 24
 
< 0.1%

funded_amnt_inv
Real number (ℝ)

Distinct10057
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15023.43762
Minimum0
Maximum40000
Zeros233
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:37.812797image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3200
Q18000
median12800
Q320000
95-th percentile35000
Maximum40000
Range40000
Interquartile range (IQR)12000

Descriptive statistics

Standard deviation9192.331807
Coefficient of variation (CV)0.6118660747
Kurtosis-0.1166815151
Mean15023.43762
Median Absolute Deviation (MAD)6200
Skewness0.7782542385
Sum3.396300469 × 1010
Variance84498964.04
MonotonicityNot monotonic
2023-04-16T23:54:37.896408image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10000 177561
 
7.9%
20000 120453
 
5.3%
15000 114539
 
5.1%
12000 114068
 
5.0%
5000 81999
 
3.6%
35000 76093
 
3.4%
8000 71528
 
3.2%
6000 69475
 
3.1%
16000 61840
 
2.7%
25000 60610
 
2.7%
Other values (10047) 1312502
58.1%
ValueCountFrequency (%)
0 233
< 0.1%
0.000121098108 1
 
< 0.1%
0.000185369401 1
 
< 0.1%
0.000242055511 1
 
< 0.1%
0.000531133069 1
 
< 0.1%
ValueCountFrequency (%)
40000 31767
1.4%
39975 616
 
< 0.1%
39950 218
 
< 0.1%
39925 58
 
< 0.1%
39900 29
 
< 0.1%

grade
Categorical

Distinct7
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.2 MiB
B
663557 
C
650053 
A
433027 
D
324424 
E
135639 
Other values (2)
 
53968

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowC
2nd rowD
3rd rowD
4th rowD
5th rowC

Common Values

ValueCountFrequency (%)
B 663557
29.4%
C 650053
28.8%
A 433027
19.2%
D 324424
14.4%
E 135639
 
6.0%
F 41800
 
1.8%
G 12168
 
0.5%

Common Values (Plot)

2023-04-16T23:54:38.003513image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

hardship_amount
Real number (ℝ)

Distinct8950
Distinct (%)84.3%
Missing2250055
Missing (%)99.5%
Infinite0
Infinite (%)0.0%
Mean155.0066956
Minimum0.64
Maximum943.94
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:38.082350image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0.64
5-th percentile21.134
Q159.37
median119.04
Q3213.26
95-th percentile410.228
Maximum943.94
Range943.3
Interquartile range (IQR)153.89

Descriptive statistics

Standard deviation129.1131374
Coefficient of variation (CV)0.8329520019
Kurtosis3.302815903
Mean155.0066956
Median Absolute Deviation (MAD)69.06
Skewness1.606076862
Sum1645086.06
Variance16670.20224
MonotonicityNot monotonic
2023-04-16T23:54:38.167560image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
69.9 5
 
< 0.1%
94.59 5
 
< 0.1%
53.05 5
 
< 0.1%
48.56 5
 
< 0.1%
132.33 5
 
< 0.1%
158.83 4
 
< 0.1%
58 4
 
< 0.1%
62.3 4
 
< 0.1%
79.66 4
 
< 0.1%
61.5 4
 
< 0.1%
Other values (8940) 10568
 
0.5%
(Missing) 2250055
99.5%
ValueCountFrequency (%)
0.64 1
< 0.1%
1.47 1
< 0.1%
1.61 1
< 0.1%
2.02 1
< 0.1%
2.15 1
< 0.1%
ValueCountFrequency (%)
943.94 1
< 0.1%
923.4 1
< 0.1%
893.63 1
< 0.1%
893.05 1
< 0.1%
845.22 1
< 0.1%

hardship_dpd
Real number (ℝ)

Distinct34
Distinct (%)0.3%
Missing2250055
Missing (%)99.5%
Infinite0
Infinite (%)0.0%
Mean13.68642231
Minimum0
Maximum37
Zeros2402
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:38.261969image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q15
median15
Q322
95-th percentile28
Maximum37
Range37
Interquartile range (IQR)17

Descriptive statistics

Standard deviation9.7281384
Coefficient of variation (CV)0.7107875366
Kurtosis-1.312529432
Mean13.68642231
Median Absolute Deviation (MAD)8
Skewness-0.1242211405
Sum145254
Variance94.63667673
MonotonicityNot monotonic
2023-04-16T23:54:38.341262image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
0 2402
 
0.1%
23 410
 
< 0.1%
26 409
 
< 0.1%
25 405
 
< 0.1%
20 380
 
< 0.1%
16 372
 
< 0.1%
27 361
 
< 0.1%
11 359
 
< 0.1%
17 359
 
< 0.1%
22 357
 
< 0.1%
Other values (24) 4799
 
0.2%
(Missing) 2250055
99.5%
ValueCountFrequency (%)
0 2402
0.1%
1 43
 
< 0.1%
2 49
 
< 0.1%
3 61
 
< 0.1%
4 78
 
< 0.1%
ValueCountFrequency (%)
37 1
 
< 0.1%
32 4
 
< 0.1%
31 2
 
< 0.1%
30 29
 
< 0.1%
29 268
< 0.1%

hardship_end_date
Categorical

IMBALANCE  MISSING 

Distinct28
Distinct (%)< 0.1%
Missing95321
Missing (%)4.2%
Memory size17.2 MiB
2154734 
Dec-2017
 
1756
Nov-2017
 
1325
Jan-2018
 
749
Jan-2019
 
518
Other values (23)
 
6265

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
2154734
95.3%
Dec-2017 1756
 
0.1%
Nov-2017 1325
 
0.1%
Jan-2018 749
 
< 0.1%
Jan-2019 518
 
< 0.1%
Dec-2018 509
 
< 0.1%
Feb-2019 471
 
< 0.1%
Nov-2018 413
 
< 0.1%
Feb-2018 401
 
< 0.1%
Oct-2018 397
 
< 0.1%
Other values (18) 4074
 
0.2%
(Missing) 95321
 
4.2%

hardship_flag
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.2 MiB
N
2259783 
Y
 
885

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowN
2nd rowN
3rd rowN
4th rowN
5th rowN

Common Values

ValueCountFrequency (%)
N 2259783
> 99.9%
Y 885
 
< 0.1%

Common Values (Plot)

2023-04-16T23:54:38.420186image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Distinct8795
Distinct (%)82.9%
Missing2250055
Missing (%)99.5%
Infinite0
Infinite (%)0.0%
Mean193.6063309
Minimum0.01
Maximum1407.86
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:38.485143image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0.01
5-th percentile0.35
Q143.78
median132.89
Q3284.18
95-th percentile596.41
Maximum1407.86
Range1407.85
Interquartile range (IQR)240.4

Descriptive statistics

Standard deviation198.6943679
Coefficient of variation (CV)1.026280323
Kurtosis3.212550008
Mean193.6063309
Median Absolute Deviation (MAD)105.35
Skewness1.620484485
Sum2054743.99
Variance39479.45183
MonotonicityNot monotonic
2023-04-16T23:54:38.578617image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.03 25
 
< 0.1%
0.02 25
 
< 0.1%
0.12 23
 
< 0.1%
0.11 23
 
< 0.1%
0.1 21
 
< 0.1%
0.05 21
 
< 0.1%
0.04 20
 
< 0.1%
0.07 18
 
< 0.1%
0.09 18
 
< 0.1%
0.13 17
 
< 0.1%
Other values (8785) 10402
 
0.5%
(Missing) 2250055
99.5%
ValueCountFrequency (%)
0.01 16
< 0.1%
0.02 25
< 0.1%
0.03 25
< 0.1%
0.04 20
< 0.1%
0.05 21
< 0.1%
ValueCountFrequency (%)
1407.86 1
< 0.1%
1377.17 1
< 0.1%
1291.21 1
< 0.1%
1290.59 1
< 0.1%
1283.9 1
< 0.1%

hardship_length
Real number (ℝ)

CONSTANT  MISSING 

Distinct1
Distinct (%)< 0.1%
Missing2250055
Missing (%)99.5%
Infinite0
Infinite (%)0.0%
Mean3
Minimum3
Maximum3
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:38.659863image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile3
Q13
median3
Q33
95-th percentile3
Maximum3
Range0
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0
Coefficient of variation (CV)0
Kurtosis0
Mean3
Median Absolute Deviation (MAD)0
Skewness0
Sum31839
Variance0
MonotonicityIncreasing
2023-04-16T23:54:38.718754image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=1)
ValueCountFrequency (%)
3 10613
 
0.5%
(Missing) 2250055
99.5%
ValueCountFrequency (%)
3 10613
0.5%
ValueCountFrequency (%)
3 10613
0.5%

hardship_loan_status
Categorical

IMBALANCE  MISSING 

Distinct6
Distinct (%)< 0.1%
Missing95321
Missing (%)4.2%
Memory size17.2 MiB
2154734 
Late (16-30 days)
 
4622
In Grace Period
 
2806
Current
 
2737
Late (31-120 days)
 
433

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
2154734
95.3%
Late (16-30 days) 4622
 
0.2%
In Grace Period 2806
 
0.1%
Current 2737
 
0.1%
Late (31-120 days) 433
 
< 0.1%
Issued 15
 
< 0.1%
(Missing) 95321
 
4.2%

Common Values (Plot)

2023-04-16T23:54:38.802014image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Distinct10591
Distinct (%)99.8%
Missing2250055
Missing (%)99.5%
Infinite0
Infinite (%)0.0%
Mean11628.03644
Minimum55.73
Maximum40306.41
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:38.882378image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum55.73
5-th percentile2177.796
Q15628.73
median10044.22
Q316114.94
95-th percentile26839.01
Maximum40306.41
Range40250.68
Interquartile range (IQR)10486.21

Descriptive statistics

Standard deviation7615.161123
Coefficient of variation (CV)0.6548965649
Kurtosis0.2100018537
Mean11628.03644
Median Absolute Deviation (MAD)4996.94
Skewness0.8660260461
Sum123408350.8
Variance57990678.92
MonotonicityNot monotonic
2023-04-16T23:54:38.978848image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8234.71 2
 
< 0.1%
11759.5 2
 
< 0.1%
5131.62 2
 
< 0.1%
10747.75 2
 
< 0.1%
4775.7 2
 
< 0.1%
10797.54 2
 
< 0.1%
12031.29 2
 
< 0.1%
15557.86 2
 
< 0.1%
6627.94 2
 
< 0.1%
8254.12 2
 
< 0.1%
Other values (10581) 10593
 
0.5%
(Missing) 2250055
99.5%
ValueCountFrequency (%)
55.73 1
< 0.1%
174.15 1
< 0.1%
191.12 1
< 0.1%
193.98 1
< 0.1%
206.97 1
< 0.1%
ValueCountFrequency (%)
40306.41 1
< 0.1%
40149.35 1
< 0.1%
39746.94 1
< 0.1%
39542.45 1
< 0.1%
38824.41 1
< 0.1%

hardship_reason
Categorical

IMBALANCE  MISSING 

Distinct10
Distinct (%)< 0.1%
Missing95321
Missing (%)4.2%
Memory size17.2 MiB
2154734 
NATURAL_DISASTER
 
2965
EXCESSIVE_OBLIGATIONS
 
2079
UNEMPLOYMENT
 
1834
INCOME_CURTAILMENT
 
1279
Other values (5)
 
2456

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
2154734
95.3%
NATURAL_DISASTER 2965
 
0.1%
EXCESSIVE_OBLIGATIONS 2079
 
0.1%
UNEMPLOYMENT 1834
 
0.1%
INCOME_CURTAILMENT 1279
 
0.1%
MEDICAL 1249
 
0.1%
REDUCED_HOURS 629
 
< 0.1%
DIVORCE 218
 
< 0.1%
FAMILY_DEATH 206
 
< 0.1%
DISABILITY 154
 
< 0.1%
(Missing) 95321
 
4.2%

Common Values (Plot)

2023-04-16T23:54:39.088543image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

hardship_start_date
Categorical

IMBALANCE  MISSING 

Distinct27
Distinct (%)< 0.1%
Missing95321
Missing (%)4.2%
Memory size17.2 MiB
2154734 
Sep-2017
 
2444
Oct-2017
 
1077
Oct-2018
 
594
Nov-2017
 
466
Other values (22)
 
6032

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
2154734
95.3%
Sep-2017 2444
 
0.1%
Oct-2017 1077
 
< 0.1%
Oct-2018 594
 
< 0.1%
Nov-2017 466
 
< 0.1%
Aug-2018 463
 
< 0.1%
Jan-2019 431
 
< 0.1%
Sep-2018 422
 
< 0.1%
Nov-2018 420
 
< 0.1%
Jun-2017 400
 
< 0.1%
Other values (17) 3896
 
0.2%
(Missing) 95321
 
4.2%

hardship_status
Categorical

IMBALANCE  MISSING 

Distinct4
Distinct (%)< 0.1%
Missing95321
Missing (%)4.2%
Memory size17.2 MiB
2154734 
COMPLETED
 
7541
BROKEN
 
2187
ACTIVE
 
885

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
2154734
95.3%
COMPLETED 7541
 
0.3%
BROKEN 2187
 
0.1%
ACTIVE 885
 
< 0.1%
(Missing) 95321
 
4.2%

Common Values (Plot)

2023-04-16T23:54:39.200840image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

hardship_type
Categorical

IMBALANCE  MISSING 

Distinct2
Distinct (%)< 0.1%
Missing95321
Missing (%)4.2%
Memory size17.2 MiB
2154734 
INTEREST ONLY-3 MONTHS DEFERRAL
 
10613

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
2154734
95.3%
INTEREST ONLY-3 MONTHS DEFERRAL 10613
 
0.5%
(Missing) 95321
 
4.2%

Common Values (Plot)

2023-04-16T23:54:39.275923image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

home_ownership
Categorical

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.2 MiB
MORTGAGE
1111450 
RENT
894929 
OWN
253057 
ANY
 
996
OTHER
 
182

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowRENT
2nd rowMORTGAGE
3rd rowMORTGAGE
4th rowMORTGAGE
5th rowMORTGAGE

Common Values

ValueCountFrequency (%)
MORTGAGE 1111450
49.2%
RENT 894929
39.6%
OWN 253057
 
11.2%
ANY 996
 
< 0.1%
OTHER 182
 
< 0.1%
NONE 54
 
< 0.1%

Common Values (Plot)

2023-04-16T23:54:39.355464image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

id
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2260668
Missing (%)100.0%
Memory size17.2 MiB

il_util
Real number (ℝ)

Distinct280
Distinct (%)< 0.1%
Missing1068850
Missing (%)47.3%
Infinite0
Infinite (%)0.0%
Mean69.14097958
Minimum0
Maximum1000
Zeros6629
Zeros (%)0.3%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:39.449284image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile25
Q155
median72
Q386
95-th percentile101
Maximum1000
Range1000
Interquartile range (IQR)31

Descriptive statistics

Standard deviation23.74838634
Coefficient of variation (CV)0.3434777246
Kurtosis7.081786181
Mean69.14097958
Median Absolute Deviation (MAD)15
Skewness-0.1724200621
Sum82403464
Variance563.9858538
MonotonicityNot monotonic
2023-04-16T23:54:39.530344image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
78 22659
 
1.0%
75 22425
 
1.0%
81 22330
 
1.0%
83 22286
 
1.0%
72 22023
 
1.0%
77 21852
 
1.0%
80 21737
 
1.0%
82 21553
 
1.0%
74 21496
 
1.0%
79 21478
 
1.0%
Other values (270) 971979
43.0%
(Missing) 1068850
47.3%
ValueCountFrequency (%)
0 6629
0.3%
1 439
 
< 0.1%
2 886
 
< 0.1%
3 1761
 
0.1%
4 1112
 
< 0.1%
ValueCountFrequency (%)
1000 3
< 0.1%
558 1
 
< 0.1%
464 1
 
< 0.1%
428 1
 
< 0.1%
417 1
 
< 0.1%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.2 MiB
w
1535467 
f
725201 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st roww
2nd roww
3rd roww
4th roww
5th roww

Common Values

ValueCountFrequency (%)
w 1535467
67.9%
f 725201
32.1%

Common Values (Plot)

2023-04-16T23:54:39.620186image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

inq_fi
Real number (ℝ)

MISSING  ZEROS 

Distinct33
Distinct (%)< 0.1%
Missing866129
Missing (%)38.3%
Infinite0
Infinite (%)0.0%
Mean1.012866618
Minimum0
Maximum48
Zeros697142
Zeros (%)30.8%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:39.687441image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q31
95-th percentile4
Maximum48
Range48
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.489455668
Coefficient of variation (CV)1.470534858
Kurtosis13.46503817
Mean1.012866618
Median Absolute Deviation (MAD)1
Skewness2.665084535
Sum1412482
Variance2.218478187
MonotonicityNot monotonic
2023-04-16T23:54:39.776458image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
0 697142
30.8%
1 352169
15.6%
2 172982
 
7.7%
3 83887
 
3.7%
4 41460
 
1.8%
5 21374
 
0.9%
6 11043
 
0.5%
7 6059
 
0.3%
8 3423
 
0.2%
9 1946
 
0.1%
Other values (23) 3054
 
0.1%
(Missing) 866129
38.3%
ValueCountFrequency (%)
0 697142
30.8%
1 352169
15.6%
2 172982
 
7.7%
3 83887
 
3.7%
4 41460
 
1.8%
ValueCountFrequency (%)
48 1
< 0.1%
38 1
< 0.1%
32 1
< 0.1%
31 1
< 0.1%
29 1
< 0.1%

inq_last_12m
Real number (ℝ)

MISSING  ZEROS 

Distinct48
Distinct (%)< 0.1%
Missing866130
Missing (%)38.3%
Infinite0
Infinite (%)0.0%
Mean2.036667341
Minimum0
Maximum67
Zeros400090
Zeros (%)17.7%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:39.858962image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q33
95-th percentile7
Maximum67
Range67
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.383117224
Coefficient of variation (CV)1.170106269
Kurtosis11.22465969
Mean2.036667341
Median Absolute Deviation (MAD)1
Skewness2.405157988
Sum2840210
Variance5.679247702
MonotonicityNot monotonic
2023-04-16T23:54:40.450794image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=48)
ValueCountFrequency (%)
0 400090
17.7%
1 339411
 
15.0%
2 235736
 
10.4%
3 153565
 
6.8%
4 96535
 
4.3%
5 59861
 
2.6%
6 37765
 
1.7%
7 24088
 
1.1%
8 15619
 
0.7%
9 10201
 
0.5%
Other values (38) 21667
 
1.0%
(Missing) 866130
38.3%
ValueCountFrequency (%)
0 400090
17.7%
1 339411
15.0%
2 235736
10.4%
3 153565
 
6.8%
4 96535
 
4.3%
ValueCountFrequency (%)
67 1
< 0.1%
51 1
< 0.1%
49 1
< 0.1%
46 1
< 0.1%
45 1
< 0.1%

inq_last_6mths
Real number (ℝ)

Distinct28
Distinct (%)< 0.1%
Missing30
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean0.5768353889
Minimum0
Maximum33
Zeros1381722
Zeros (%)61.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:40.542233image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile2
Maximum33
Range33
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.8859631584
Coefficient of variation (CV)1.53590292
Kurtosis9.57608952
Mean0.5768353889
Median Absolute Deviation (MAD)0
Skewness2.066186683
Sum1304016
Variance0.784930718
MonotonicityNot monotonic
2023-04-16T23:54:40.611963image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=28)
ValueCountFrequency (%)
0 1381722
61.1%
1 584390
25.9%
2 200212
 
8.9%
3 69009
 
3.1%
4 17380
 
0.8%
5 6232
 
0.3%
6 1231
 
0.1%
7 195
 
< 0.1%
8 122
 
< 0.1%
9 50
 
< 0.1%
Other values (18) 95
 
< 0.1%
(Missing) 30
 
< 0.1%
ValueCountFrequency (%)
0 1381722
61.1%
1 584390
25.9%
2 200212
 
8.9%
3 69009
 
3.1%
4 17380
 
0.8%
ValueCountFrequency (%)
33 1
< 0.1%
32 1
< 0.1%
31 1
< 0.1%
28 1
< 0.1%
27 1
< 0.1%

installment
Real number (ℝ)

Distinct93296
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean445.8076459
Minimum4.93
Maximum1719.83
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:40.711543image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum4.93
5-th percentile110.43
Q1251.65
median377.99
Q3593.32
95-th percentile984.47
Maximum1719.83
Range1714.9
Interquartile range (IQR)341.67

Descriptive statistics

Standard deviation267.173725
Coefficient of variation (CV)0.599302698
Kurtosis0.6898708652
Mean445.8076459
Median Absolute Deviation (MAD)157.63
Skewness1.001778382
Sum1007823079
Variance71381.79933
MonotonicityNot monotonic
2023-04-16T23:54:40.820197image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
301.15 4420
 
0.2%
332.1 4153
 
0.2%
361.38 3704
 
0.2%
327.34 3353
 
0.1%
602.3 3095
 
0.1%
451.73 3076
 
0.1%
329.72 2614
 
0.1%
166.05 2508
 
0.1%
498.15 2410
 
0.1%
180.69 2364
 
0.1%
Other values (93286) 2228971
98.6%
ValueCountFrequency (%)
4.93 1
< 0.1%
7.61 1
< 0.1%
14.01 1
< 0.1%
14.77 1
< 0.1%
15.67 1
< 0.1%
ValueCountFrequency (%)
1719.83 2
 
< 0.1%
1717.63 1
 
< 0.1%
1715.42 2
 
< 0.1%
1714.54 6
< 0.1%
1691.28 2
 
< 0.1%

int_rate
Real number (ℝ)

Distinct673
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.09291294
Minimum5.31
Maximum30.99
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:40.929370image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum5.31
5-th percentile6.49
Q19.49
median12.62
Q315.99
95-th percentile22.15
Maximum30.99
Range25.68
Interquartile range (IQR)6.5

Descriptive statistics

Standard deviation4.832114233
Coefficient of variation (CV)0.3690633439
Kurtosis0.5940480996
Mean13.09291294
Median Absolute Deviation (MAD)3.18
Skewness0.7680743803
Sum29598729.32
Variance23.34932796
MonotonicityNot monotonic
2023-04-16T23:54:41.023806image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11.99 53869
 
2.4%
5.32 47171
 
2.1%
10.99 44165
 
2.0%
13.99 43026
 
1.9%
11.49 32009
 
1.4%
16.99 30564
 
1.4%
12.99 29276
 
1.3%
7.89 28515
 
1.3%
9.17 27835
 
1.2%
15.61 25208
 
1.1%
Other values (663) 1899030
84.0%
ValueCountFrequency (%)
5.31 8613
 
0.4%
5.32 47171
2.1%
5.42 573
 
< 0.1%
5.79 410
 
< 0.1%
5.93 1812
 
0.1%
ValueCountFrequency (%)
30.99 819
< 0.1%
30.94 733
< 0.1%
30.89 699
< 0.1%
30.84 755
< 0.1%
30.79 1572
0.1%

issue_d
Categorical

Distinct139
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.2 MiB
Mar-2016
 
61992
Oct-2015
 
48631
May-2018
 
46311
Oct-2018
 
46305
Aug-2018
 
46079
Other values (134)
2011350 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowDec-2018
2nd rowDec-2018
3rd rowDec-2018
4th rowDec-2018
5th rowDec-2018

Common Values

ValueCountFrequency (%)
Mar-2016 61992
 
2.7%
Oct-2015 48631
 
2.2%
May-2018 46311
 
2.0%
Oct-2018 46305
 
2.0%
Aug-2018 46079
 
2.0%
Jul-2015 45962
 
2.0%
Dec-2015 44343
 
2.0%
Aug-2017 43573
 
1.9%
Jul-2018 43089
 
1.9%
Apr-2018 42928
 
1.9%
Other values (129) 1791455
79.2%

last_credit_pull_d
Categorical

HIGH CARDINALITY  IMBALANCE 

Distinct141
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.2 MiB
Feb-2019
1398266 
Jan-2019
 
78011
Jul-2018
 
56612
Dec-2018
 
52077
Oct-2018
 
51366
Other values (136)
624336 

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st rowFeb-2019
2nd rowFeb-2019
3rd rowFeb-2019
4th rowFeb-2019
5th rowFeb-2019

Common Values

ValueCountFrequency (%)
Feb-2019 1398266
61.9%
Jan-2019 78011
 
3.5%
Jul-2018 56612
 
2.5%
Dec-2018 52077
 
2.3%
Oct-2018 51366
 
2.3%
Oct-2016 51262
 
2.3%
Nov-2018 48552
 
2.1%
Aug-2018 45958
 
2.0%
Sep-2018 36225
 
1.6%
May-2018 26685
 
1.2%
Other values (131) 415654
 
18.4%

last_pymnt_amnt
Real number (ℝ)

Distinct692560
Distinct (%)30.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3364.015261
Minimum0
Maximum42192.05
Zeros2876
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:41.123709image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile101.07
Q1308.64
median588.47
Q33534.965
95-th percentile16766.5495
Maximum42192.05
Range42192.05
Interquartile range (IQR)3226.325

Descriptive statistics

Standard deviation5971.757409
Coefficient of variation (CV)1.775187372
Kurtosis7.299220753
Mean3364.015261
Median Absolute Deviation (MAD)383
Skewness2.603471102
Sum7604921652
Variance35661886.55
MonotonicityNot monotonic
2023-04-16T23:54:41.213979image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
50 3082
 
0.1%
0 2876
 
0.1%
100 2061
 
0.1%
301.15 1967
 
0.1%
332.1 1911
 
0.1%
361.38 1868
 
0.1%
309.74 1549
 
0.1%
320.05 1489
 
0.1%
324.65 1479
 
0.1%
304.72 1400
 
0.1%
Other values (692550) 2240986
99.1%
ValueCountFrequency (%)
0 2876
0.1%
0.01 572
 
< 0.1%
0.02 90
 
< 0.1%
0.03 80
 
< 0.1%
0.04 73
 
< 0.1%
ValueCountFrequency (%)
42192.05 1
< 0.1%
42148.53 1
< 0.1%
42005.2 1
< 0.1%
41453.07 1
< 0.1%
41434 1
< 0.1%

last_pymnt_d
Categorical

Distinct136
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.2 MiB
Feb-2019
934725 
Jan-2019
 
52576
Aug-2018
 
39615
Mar-2018
 
38269
Oct-2018
 
37468
Other values (131)
1158015 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowFeb-2019
2nd rowFeb-2019
3rd rowFeb-2019
4th rowFeb-2019
5th rowFeb-2019

Common Values

ValueCountFrequency (%)
Feb-2019 934725
41.3%
Jan-2019 52576
 
2.3%
Aug-2018 39615
 
1.8%
Mar-2018 38269
 
1.7%
Oct-2018 37468
 
1.7%
Jul-2018 36497
 
1.6%
Nov-2018 35714
 
1.6%
Jun-2018 35168
 
1.6%
May-2018 33456
 
1.5%
Dec-2018 33451
 
1.5%
Other values (126) 983729
43.5%

loan_amnt
Real number (ℝ)

Distinct1572
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15046.93123
Minimum500
Maximum40000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.6 MiB
2023-04-16T23:54:41.307471image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum500
5-th percentile3250
Q18000
median12900
Q320000
95-th percentile35000
Maximum40000
Range39500
Interquartile range (IQR)12000

Descriptive statistics

Standard deviation9190.245488
Coefficient of variation (CV)0.6107720803
Kurtosis-0.1194391577
Mean15046.93123
Median Absolute Deviation (MAD)6200
Skewness0.7777823287
Sum3.401611592 × 1010
Variance84460612.13
MonotonicityNot monotonic
2023-04-16T23:54:41.402049image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10000 187236
 
8.3%
20000 131006
 
5.8%
15000 123226
 
5.5%
12000 121681
 
5.4%
35000 86285
 
3.8%
5000 84765
 
3.7%
8000 75033
 
3.3%
6000 72089
 
3.2%
25000 66453
 
2.9%
16000 66418
 
2.9%
Other values (1562) 1246476
55.1%
ValueCountFrequency (%)
500 11
< 0.1%
550 1
 
< 0.1%
600 6
< 0.1%
700 3
 
< 0.1%
725 1
 
< 0.1%
ValueCountFrequency (%)
40000 33368
1.5%
39975 11
 
< 0.1%
39950 10
 
< 0.1%
39925 14
 
< 0.1%
39900 24
 
< 0.1%

loan_status
Categorical

Distinct9
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.2 MiB
Fully Paid
1041952 
Current
919695 
Charged Off
261655 
Late (31-120 days)
 
21897
In Grace Period
 
8952
Other values (4)
 
6517

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCurrent
2nd rowCurrent
3rd rowCurrent
4th rowCurrent
5th rowCurrent

Common Values

ValueCountFrequency (%)
Fully Paid 1041952
46.1%
Current 919695
40.7%
Charged Off 261655
 
11.6%
Late (31-120 days) 21897
 
1.0%
In Grace Period 8952
 
0.4%
Late (16-30 days) 3737
 
0.2%
Does not meet the credit policy. Status:Fully Paid 1988
 
0.1%
Does not meet the credit policy. Status:Charged Off 761
 
< 0.1%
Default 31
 
< 0.1%

Common Values (Plot)

2023-04-16T23:54:41.513393image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

max_bal_bc
Real number (ℝ)

MISSING  ZEROS 

Distinct33726
Distinct (%)2.4%
Missing866129
Missing (%)38.3%
Infinite0
Infinite (%)0.0%
Mean5806.392905
Minimum0
Maximum1170668
Zeros35917
Zeros (%)1.6%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:41.616504image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile340
Q12284
median4413
Q37598
95-th percentile16311
Maximum1170668
Range1170668
Interquartile range (IQR)5314

Descriptive statistics

Standard deviation5690.561012
Coefficient of variation (CV)0.9800509723
Kurtosis1748.219344
Mean5806.392905
Median Absolute Deviation (MAD)2462
Skewness13.69539884
Sum8097241355
Variance32382484.63
MonotonicityNot monotonic
2023-04-16T23:54:41.715754image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 35917
 
1.6%
8 572
 
< 0.1%
3000 533
 
< 0.1%
2000 504
 
< 0.1%
4000 447
 
< 0.1%
5000 423
 
< 0.1%
2500 383
 
< 0.1%
1900 333
 
< 0.1%
1500 333
 
< 0.1%
3500 330
 
< 0.1%
Other values (33716) 1354764
59.9%
(Missing) 866129
38.3%
ValueCountFrequency (%)
0 35917
1.6%
1 165
 
< 0.1%
2 218
 
< 0.1%
3 234
 
< 0.1%
4 217
 
< 0.1%
ValueCountFrequency (%)
1170668 1
< 0.1%
776843 1
< 0.1%
571793 1
< 0.1%
500000 1
< 0.1%
457521 1
< 0.1%

member_id
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2260668
Missing (%)100.0%
Memory size17.2 MiB

mo_sin_old_il_acct
Real number (ℝ)

Distinct566
Distinct (%)< 0.1%
Missing139071
Missing (%)6.2%
Infinite0
Infinite (%)0.0%
Mean125.7377608
Minimum0
Maximum999
Zeros16
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:41.823223image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile31
Q196
median130
Q3154
95-th percentile213
Maximum999
Range999
Interquartile range (IQR)58

Descriptive statistics

Standard deviation53.38217542
Coefficient of variation (CV)0.4245516629
Kurtosis1.843977671
Mean125.7377608
Median Absolute Deviation (MAD)27
Skewness0.3513513484
Sum266764856
Variance2849.656652
MonotonicityNot monotonic
2023-04-16T23:54:41.916695image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
130 25017
 
1.1%
129 24849
 
1.1%
128 24798
 
1.1%
132 24767
 
1.1%
127 24749
 
1.1%
133 24731
 
1.1%
125 24683
 
1.1%
126 24649
 
1.1%
134 24574
 
1.1%
131 24515
 
1.1%
Other values (556) 1874265
82.9%
(Missing) 139071
 
6.2%
ValueCountFrequency (%)
0 16
 
< 0.1%
1 507
 
< 0.1%
2 1003
< 0.1%
3 1414
0.1%
4 1616
0.1%
ValueCountFrequency (%)
999 2
< 0.1%
848 1
< 0.1%
827 1
< 0.1%
822 1
< 0.1%
808 1
< 0.1%

mo_sin_old_rev_tl_op
Real number (ℝ)

Distinct787
Distinct (%)< 0.1%
Missing70277
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean181.4915675
Minimum1
Maximum999
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:42.002587image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile49
Q1116
median164
Q3232
95-th percentile369
Maximum999
Range998
Interquartile range (IQR)116

Descriptive statistics

Standard deviation97.11845373
Coefficient of variation (CV)0.5351127608
Kurtosis1.355518547
Mean181.4915675
Median Absolute Deviation (MAD)56
Skewness1.007808058
Sum397537496
Variance9431.994056
MonotonicityNot monotonic
2023-04-16T23:54:42.099821image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
132 12536
 
0.6%
136 12528
 
0.6%
131 12386
 
0.5%
134 12317
 
0.5%
130 12299
 
0.5%
133 12278
 
0.5%
135 12235
 
0.5%
137 12132
 
0.5%
129 12116
 
0.5%
140 12112
 
0.5%
Other values (777) 2067452
91.5%
(Missing) 70277
 
3.1%
ValueCountFrequency (%)
1 2
 
< 0.1%
2 6
 
< 0.1%
3 17
< 0.1%
4 19
< 0.1%
5 42
< 0.1%
ValueCountFrequency (%)
999 1
< 0.1%
901 1
< 0.1%
852 1
< 0.1%
851 1
< 0.1%
842 1
< 0.1%

mo_sin_rcnt_rev_tl_op
Real number (ℝ)

MISSING  ZEROS 

Distinct333
Distinct (%)< 0.1%
Missing70277
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean14.02408931
Minimum0
Maximum547
Zeros33747
Zeros (%)1.5%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:42.189491image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q14
median8
Q317
95-th percentile46
Maximum547
Range547
Interquartile range (IQR)13

Descriptive statistics

Standard deviation17.53308255
Coefficient of variation (CV)1.250211844
Kurtosis21.96014259
Mean14.02408931
Median Absolute Deviation (MAD)5
Skewness3.592705204
Sum30718239
Variance307.4089838
MonotonicityNot monotonic
2023-04-16T23:54:42.284185image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2 167693
 
7.4%
3 161273
 
7.1%
4 147691
 
6.5%
1 137574
 
6.1%
5 132964
 
5.9%
6 117844
 
5.2%
7 107901
 
4.8%
8 96426
 
4.3%
9 85716
 
3.8%
10 78061
 
3.5%
Other values (323) 957248
42.3%
ValueCountFrequency (%)
0 33747
 
1.5%
1 137574
6.1%
2 167693
7.4%
3 161273
7.1%
4 147691
6.5%
ValueCountFrequency (%)
547 1
< 0.1%
502 1
< 0.1%
438 1
< 0.1%
406 1
< 0.1%
404 1
< 0.1%

mo_sin_rcnt_tl
Real number (ℝ)

MISSING  ZEROS 

Distinct232
Distinct (%)< 0.1%
Missing70276
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean8.297468672
Minimum0
Maximum382
Zeros34923
Zeros (%)1.5%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:42.379988image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q13
median6
Q311
95-th percentile24
Maximum382
Range382
Interquartile range (IQR)8

Descriptive statistics

Standard deviation9.208556539
Coefficient of variation (CV)1.10980311
Kurtosis46.21937168
Mean8.297468672
Median Absolute Deviation (MAD)3
Skewness4.601065685
Sum18174709
Variance84.79751352
MonotonicityNot monotonic
2023-04-16T23:54:42.473899image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2 235575
10.4%
3 228153
 
10.1%
4 203278
 
9.0%
1 180626
 
8.0%
5 175501
 
7.8%
6 151978
 
6.7%
7 135550
 
6.0%
8 114936
 
5.1%
9 96422
 
4.3%
10 82385
 
3.6%
Other values (222) 585988
25.9%
ValueCountFrequency (%)
0 34923
 
1.5%
1 180626
8.0%
2 235575
10.4%
3 228153
10.1%
4 203278
9.0%
ValueCountFrequency (%)
382 1
< 0.1%
368 1
< 0.1%
353 1
< 0.1%
331 1
< 0.1%
314 1
< 0.1%

mort_acc
Real number (ℝ)

MISSING  ZEROS 

Distinct47
Distinct (%)< 0.1%
Missing50030
Missing (%)2.2%
Infinite0
Infinite (%)0.0%
Mean1.555382202
Minimum0
Maximum94
Zeros929606
Zeros (%)41.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:42.583474image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q33
95-th percentile5
Maximum94
Range94
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.904981017
Coefficient of variation (CV)1.224767144
Kurtosis10.50475648
Mean1.555382202
Median Absolute Deviation (MAD)1
Skewness1.789683755
Sum3438387
Variance3.628952676
MonotonicityNot monotonic
2023-04-16T23:54:42.677262image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
0 929606
41.1%
1 393270
17.4%
2 325903
 
14.4%
3 231066
 
10.2%
4 150002
 
6.6%
5 86666
 
3.8%
6 46804
 
2.1%
7 23419
 
1.0%
8 11450
 
0.5%
9 5742
 
0.3%
Other values (37) 6710
 
0.3%
(Missing) 50030
 
2.2%
ValueCountFrequency (%)
0 929606
41.1%
1 393270
17.4%
2 325903
 
14.4%
3 231066
 
10.2%
4 150002
 
6.6%
ValueCountFrequency (%)
94 1
< 0.1%
87 1
< 0.1%
61 1
< 0.1%
52 1
< 0.1%
51 1
< 0.1%

mths_since_last_delinq
Real number (ℝ)

Distinct173
Distinct (%)< 0.1%
Missing1158502
Missing (%)51.2%
Infinite0
Infinite (%)0.0%
Mean34.5409158
Minimum0
Maximum226
Zeros2637
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:42.756120image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile5
Q116
median31
Q350
95-th percentile74
Maximum226
Range226
Interquartile range (IQR)34

Descriptive statistics

Standard deviation21.9004709
Coefficient of variation (CV)0.6340443035
Kurtosis-0.6936426553
Mean34.5409158
Median Absolute Deviation (MAD)17
Skewness0.4597452469
Sum38069823
Variance479.6306255
MonotonicityNot monotonic
2023-04-16T23:54:42.853617image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
12 20967
 
0.9%
13 20629
 
0.9%
6 20615
 
0.9%
9 20345
 
0.9%
7 20139
 
0.9%
14 19913
 
0.9%
8 19575
 
0.9%
15 19555
 
0.9%
10 19371
 
0.9%
18 19042
 
0.8%
Other values (163) 902015
39.9%
(Missing) 1158502
51.2%
ValueCountFrequency (%)
0 2637
 
0.1%
1 7100
0.3%
2 9853
0.4%
3 13089
0.6%
4 15506
0.7%
ValueCountFrequency (%)
226 1
< 0.1%
202 1
< 0.1%
195 1
< 0.1%
192 1
< 0.1%
188 2
< 0.1%
Distinct183
Distinct (%)< 0.1%
Missing1679893
Missing (%)74.3%
Infinite0
Infinite (%)0.0%
Mean44.16422022
Minimum0
Maximum226
Zeros375
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:42.948539image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile10
Q127
median44
Q362
95-th percentile77
Maximum226
Range226
Interquartile range (IQR)35

Descriptive statistics

Standard deviation21.53312059
Coefficient of variation (CV)0.4875693601
Kurtosis-0.5492963164
Mean44.16422022
Median Absolute Deviation (MAD)17
Skewness0.09211592216
Sum25649475
Variance463.6752825
MonotonicityNot monotonic
2023-04-16T23:54:43.041540image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
45 9181
 
0.4%
43 9126
 
0.4%
42 9096
 
0.4%
44 9082
 
0.4%
46 9042
 
0.4%
48 8978
 
0.4%
40 8962
 
0.4%
41 8881
 
0.4%
47 8866
 
0.4%
38 8864
 
0.4%
Other values (173) 490697
 
21.7%
(Missing) 1679893
74.3%
ValueCountFrequency (%)
0 375
 
< 0.1%
1 1338
0.1%
2 1575
0.1%
3 1968
0.1%
4 2632
0.1%
ValueCountFrequency (%)
226 1
< 0.1%
202 1
< 0.1%
197 1
< 0.1%
195 1
< 0.1%
192 1
< 0.1%

mths_since_last_record
Real number (ℝ)

Distinct129
Distinct (%)< 0.1%
Missing1901512
Missing (%)84.1%
Infinite0
Infinite (%)0.0%
Mean72.31284177
Minimum0
Maximum129
Zeros1296
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:43.127082image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile24
Q155
median74
Q392
95-th percentile113
Maximum129
Range129
Interquartile range (IQR)37

Descriptive statistics

Standard deviation26.46409448
Coefficient of variation (CV)0.3659667333
Kurtosis-0.4147973573
Mean72.31284177
Median Absolute Deviation (MAD)19
Skewness-0.3692538065
Sum25971591
Variance700.3482964
MonotonicityNot monotonic
2023-04-16T23:54:43.227495image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
79 5448
 
0.2%
80 5434
 
0.2%
82 5405
 
0.2%
77 5356
 
0.2%
76 5326
 
0.2%
81 5278
 
0.2%
78 5263
 
0.2%
75 5262
 
0.2%
72 5176
 
0.2%
71 5164
 
0.2%
Other values (119) 306044
 
13.5%
(Missing) 1901512
84.1%
ValueCountFrequency (%)
0 1296
0.1%
1 162
 
< 0.1%
2 180
 
< 0.1%
3 316
 
< 0.1%
4 385
 
< 0.1%
ValueCountFrequency (%)
129 1
 
< 0.1%
127 1
 
< 0.1%
126 4
 
< 0.1%
125 3
 
< 0.1%
124 13
< 0.1%

mths_since_rcnt_il
Real number (ℝ)

Distinct405
Distinct (%)< 0.1%
Missing909924
Missing (%)40.3%
Infinite0
Infinite (%)0.0%
Mean21.22235672
Minimum0
Maximum511
Zeros653
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:43.333720image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q17
median13
Q324
95-th percentile72
Maximum511
Range511
Interquartile range (IQR)17

Descriptive statistics

Standard deviation26.0491867
Coefficient of variation (CV)1.227440809
Kurtosis17.41971902
Mean21.22235672
Median Absolute Deviation (MAD)8
Skewness3.432616292
Sum28665971
Variance678.5601279
MonotonicityNot monotonic
2023-04-16T23:54:43.428056image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7 61397
 
2.7%
4 60564
 
2.7%
3 58975
 
2.6%
6 57977
 
2.6%
8 57851
 
2.6%
5 57489
 
2.5%
9 53488
 
2.4%
13 51374
 
2.3%
10 51180
 
2.3%
2 50555
 
2.2%
Other values (395) 789894
34.9%
(Missing) 909924
40.3%
ValueCountFrequency (%)
0 653
 
< 0.1%
1 27716
1.2%
2 50555
2.2%
3 58975
2.6%
4 60564
2.7%
ValueCountFrequency (%)
511 1
< 0.1%
507 1
< 0.1%
505 1
< 0.1%
503 1
< 0.1%
488 1
< 0.1%

mths_since_recent_bc
Real number (ℝ)

Distinct546
Distinct (%)< 0.1%
Missing73412
Missing (%)3.2%
Infinite0
Infinite (%)0.0%
Mean24.84485081
Minimum0
Maximum661
Zeros13450
Zeros (%)0.6%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:43.527995image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q16
median14
Q330
95-th percentile89
Maximum661
Range661
Interquartile range (IQR)24

Descriptive statistics

Standard deviation32.31925269
Coefficient of variation (CV)1.300843098
Kurtosis20.67203482
Mean24.84485081
Median Absolute Deviation (MAD)9
Skewness3.507711335
Sum54342049
Variance1044.534095
MonotonicityNot monotonic
2023-04-16T23:54:43.603707image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3 104912
 
4.6%
2 102518
 
4.5%
4 100529
 
4.4%
5 94879
 
4.2%
6 88025
 
3.9%
7 83796
 
3.7%
8 78467
 
3.5%
9 72912
 
3.2%
1 69999
 
3.1%
10 68865
 
3.0%
Other values (536) 1322354
58.5%
(Missing) 73412
 
3.2%
ValueCountFrequency (%)
0 13450
 
0.6%
1 69999
3.1%
2 102518
4.5%
3 104912
4.6%
4 100529
4.4%
ValueCountFrequency (%)
661 1
< 0.1%
656 1
< 0.1%
640 1
< 0.1%
639 1
< 0.1%
628 1
< 0.1%

mths_since_recent_bc_dlq
Real number (ℝ)

Distinct177
Distinct (%)< 0.1%
Missing1740967
Missing (%)77.0%
Infinite0
Infinite (%)0.0%
Mean39.30308966
Minimum0
Maximum202
Zeros796
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:43.714864image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile6
Q121
median37
Q357
95-th percentile77
Maximum202
Range202
Interquartile range (IQR)36

Descriptive statistics

Standard deviation22.61768864
Coefficient of variation (CV)0.575468464
Kurtosis-0.6228222102
Mean39.30308966
Median Absolute Deviation (MAD)18
Skewness0.3340775888
Sum20425855
Variance511.5598394
MonotonicityNot monotonic
2023-04-16T23:54:43.809591image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
26 8060
 
0.4%
28 8027
 
0.4%
25 7934
 
0.4%
35 7933
 
0.4%
45 7906
 
0.3%
22 7903
 
0.3%
30 7895
 
0.3%
44 7887
 
0.3%
32 7883
 
0.3%
19 7875
 
0.3%
Other values (167) 440398
 
19.5%
(Missing) 1740967
77.0%
ValueCountFrequency (%)
0 796
 
< 0.1%
1 2629
0.1%
2 2998
0.1%
3 4142
0.2%
4 4816
0.2%
ValueCountFrequency (%)
202 1
< 0.1%
195 1
< 0.1%
194 1
< 0.1%
190 1
< 0.1%
189 1
< 0.1%

mths_since_recent_inq
Real number (ℝ)

MISSING  ZEROS 

Distinct26
Distinct (%)< 0.1%
Missing295435
Missing (%)13.1%
Infinite0
Infinite (%)0.0%
Mean7.024194078
Minimum0
Maximum25
Zeros168927
Zeros (%)7.5%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:43.903895image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12
median5
Q311
95-th percentile19
Maximum25
Range25
Interquartile range (IQR)9

Descriptive statistics

Standard deviation5.965411442
Coefficient of variation (CV)0.8492663181
Kurtosis-0.03946573685
Mean7.024194078
Median Absolute Deviation (MAD)4
Skewness0.8890161004
Sum13804178
Variance35.58613367
MonotonicityNot monotonic
2023-04-16T23:54:43.982817image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
1 212773
 
9.4%
2 173553
 
7.7%
0 168927
 
7.5%
3 157363
 
7.0%
4 143887
 
6.4%
5 127796
 
5.7%
6 114793
 
5.1%
7 108912
 
4.8%
8 96554
 
4.3%
9 85028
 
3.8%
Other values (16) 575647
25.5%
(Missing) 295435
13.1%
ValueCountFrequency (%)
0 168927
7.5%
1 212773
9.4%
2 173553
7.7%
3 157363
7.0%
4 143887
6.4%
ValueCountFrequency (%)
25 31
 
< 0.1%
24 9247
0.4%
23 18516
0.8%
22 20110
0.9%
21 22085
1.0%
Distinct179
Distinct (%)< 0.1%
Missing1520309
Missing (%)67.3%
Infinite0
Infinite (%)0.0%
Mean35.78222322
Minimum0
Maximum202
Zeros1321
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:44.072063image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile5
Q117
median33
Q351
95-th percentile76
Maximum202
Range202
Interquartile range (IQR)34

Descriptive statistics

Standard deviation22.30723894
Coefficient of variation (CV)0.6234167957
Kurtosis-0.486454921
Mean35.78222322
Median Absolute Deviation (MAD)17
Skewness0.4954874723
Sum26491691
Variance497.6129092
MonotonicityNot monotonic
2023-04-16T23:54:44.166753image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
12 13402
 
0.6%
13 13281
 
0.6%
14 12802
 
0.6%
15 12782
 
0.6%
19 12542
 
0.6%
16 12506
 
0.6%
18 12500
 
0.6%
9 12431
 
0.5%
21 12324
 
0.5%
22 12269
 
0.5%
Other values (169) 613520
27.1%
(Missing) 1520309
67.3%
ValueCountFrequency (%)
0 1321
 
0.1%
1 4821
0.2%
2 5792
0.3%
3 7817
0.3%
4 9073
0.4%
ValueCountFrequency (%)
202 1
< 0.1%
197 1
< 0.1%
190 1
< 0.1%
188 1
< 0.1%
183 1
< 0.1%

next_pymnt_d
Categorical

HIGH CARDINALITY  IMBALANCE 

Distinct106
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.2 MiB
1303607 
Mar-2019
953821 
Feb-2019
 
406
Mar-2011
 
107
Apr-2011
 
101
Other values (101)
 
2626

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st rowMar-2019
2nd rowMar-2019
3rd rowMar-2019
4th rowMar-2019
5th rowMar-2019

Common Values

ValueCountFrequency (%)
1303607
57.7%
Mar-2019 953821
42.2%
Feb-2019 406
 
< 0.1%
Mar-2011 107
 
< 0.1%
Apr-2011 101
 
< 0.1%
Feb-2011 91
 
< 0.1%
Jan-2011 79
 
< 0.1%
Apr-2019 78
 
< 0.1%
May-2011 77
 
< 0.1%
Dec-2010 71
 
< 0.1%
Other values (96) 2230
 
0.1%

num_accts_ever_120_pd
Real number (ℝ)

MISSING  ZEROS 

Distinct44
Distinct (%)< 0.1%
Missing70276
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean0.5002081819
Minimum0
Maximum58
Zeros1687416
Zeros (%)74.6%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:44.261475image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile3
Maximum58
Range58
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.350325678
Coefficient of variation (CV)2.69952737
Kurtosis52.39404559
Mean0.5002081819
Median Absolute Deviation (MAD)0
Skewness5.434771953
Sum1095652
Variance1.823379435
MonotonicityNot monotonic
2023-04-16T23:54:44.355238image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=44)
ValueCountFrequency (%)
0 1687416
74.6%
1 270277
 
12.0%
2 105854
 
4.7%
3 48574
 
2.1%
4 28665
 
1.3%
5 17014
 
0.8%
6 11075
 
0.5%
7 7023
 
0.3%
8 4555
 
0.2%
9 2975
 
0.1%
Other values (34) 6964
 
0.3%
(Missing) 70276
 
3.1%
ValueCountFrequency (%)
0 1687416
74.6%
1 270277
 
12.0%
2 105854
 
4.7%
3 48574
 
2.1%
4 28665
 
1.3%
ValueCountFrequency (%)
58 1
< 0.1%
51 1
< 0.1%
45 1
< 0.1%
42 2
< 0.1%
39 2
< 0.1%

num_actv_bc_tl
Real number (ℝ)

MISSING  ZEROS 

Distinct42
Distinct (%)< 0.1%
Missing70276
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean3.676068941
Minimum0
Maximum50
Zeros50061
Zeros (%)2.2%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:44.453112image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q12
median3
Q35
95-th percentile8
Maximum50
Range50
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.324646415
Coefficient of variation (CV)0.6323729105
Kurtosis4.647915181
Mean3.676068941
Median Absolute Deviation (MAD)1
Skewness1.476436002
Sum8052032
Variance5.403980957
MonotonicityNot monotonic
2023-04-16T23:54:44.532486image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=42)
ValueCountFrequency (%)
3 458887
20.3%
2 442726
19.6%
4 357154
15.8%
1 256818
11.4%
5 240636
10.6%
6 152238
 
6.7%
7 91653
 
4.1%
8 55107
 
2.4%
0 50061
 
2.2%
9 33163
 
1.5%
Other values (32) 51949
 
2.3%
(Missing) 70276
 
3.1%
ValueCountFrequency (%)
0 50061
 
2.2%
1 256818
11.4%
2 442726
19.6%
3 458887
20.3%
4 357154
15.8%
ValueCountFrequency (%)
50 1
< 0.1%
48 2
< 0.1%
47 1
< 0.1%
46 1
< 0.1%
45 1
< 0.1%

num_actv_rev_tl
Real number (ℝ)

Distinct57
Distinct (%)< 0.1%
Missing70276
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean5.629467693
Minimum0
Maximum72
Zeros11439
Zeros (%)0.5%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:44.638070image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q13
median5
Q37
95-th percentile12
Maximum72
Range72
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.382873759
Coefficient of variation (CV)0.6009224927
Kurtosis4.996738255
Mean5.629467693
Median Absolute Deviation (MAD)2
Skewness1.573518347
Sum12330741
Variance11.44383487
MonotonicityNot monotonic
2023-04-16T23:54:45.194910image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4 333303
14.7%
3 308512
13.6%
5 303634
13.4%
6 248257
11.0%
2 213366
9.4%
7 191696
8.5%
8 141199
6.2%
9 102473
 
4.5%
1 84992
 
3.8%
10 72350
 
3.2%
Other values (47) 190610
8.4%
(Missing) 70276
 
3.1%
ValueCountFrequency (%)
0 11439
 
0.5%
1 84992
 
3.8%
2 213366
9.4%
3 308512
13.6%
4 333303
14.7%
ValueCountFrequency (%)
72 1
 
< 0.1%
63 1
 
< 0.1%
60 1
 
< 0.1%
59 3
< 0.1%
57 2
< 0.1%

num_bc_sats
Real number (ℝ)

MISSING  ZEROS 

Distinct60
Distinct (%)< 0.1%
Missing58590
Missing (%)2.6%
Infinite0
Infinite (%)0.0%
Mean4.774183294
Minimum0
Maximum71
Zeros23661
Zeros (%)1.0%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:45.291258image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q13
median4
Q36
95-th percentile10
Maximum71
Range71
Interquartile range (IQR)3

Descriptive statistics

Standard deviation3.037921424
Coefficient of variation (CV)0.636322746
Kurtosis6.767222463
Mean4.774183294
Median Absolute Deviation (MAD)2
Skewness1.749103457
Sum10513124
Variance9.228966576
MonotonicityNot monotonic
2023-04-16T23:54:45.384192image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3 380903
16.8%
4 359451
15.9%
2 304911
13.5%
5 289607
12.8%
6 215249
9.5%
7 151218
 
6.7%
1 148636
 
6.6%
8 103846
 
4.6%
9 70471
 
3.1%
10 47462
 
2.1%
Other values (50) 130324
 
5.8%
(Missing) 58590
 
2.6%
ValueCountFrequency (%)
0 23661
 
1.0%
1 148636
 
6.6%
2 304911
13.5%
3 380903
16.8%
4 359451
15.9%
ValueCountFrequency (%)
71 1
< 0.1%
69 1
< 0.1%
64 1
< 0.1%
63 1
< 0.1%
61 1
< 0.1%

num_bc_tl
Real number (ℝ)

Distinct76
Distinct (%)< 0.1%
Missing70276
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean7.726401941
Minimum0
Maximum86
Zeros5701
Zeros (%)0.3%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:45.495701image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q14
median7
Q310
95-th percentile17
Maximum86
Range86
Interquartile range (IQR)6

Descriptive statistics

Standard deviation4.701430113
Coefficient of variation (CV)0.6084889382
Kurtosis3.905674874
Mean7.726401941
Median Absolute Deviation (MAD)3
Skewness1.425565032
Sum16923849
Variance22.10344511
MonotonicityNot monotonic
2023-04-16T23:54:45.590032image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5 231341
10.2%
6 222963
9.9%
4 222258
9.8%
7 203775
9.0%
3 183556
8.1%
8 180553
8.0%
9 152474
 
6.7%
10 126591
 
5.6%
2 120417
 
5.3%
11 103389
 
4.6%
Other values (66) 443075
19.6%
ValueCountFrequency (%)
0 5701
 
0.3%
1 48219
 
2.1%
2 120417
5.3%
3 183556
8.1%
4 222258
9.8%
ValueCountFrequency (%)
86 1
< 0.1%
85 1
< 0.1%
82 1
< 0.1%
79 1
< 0.1%
77 1
< 0.1%

num_il_tl
Real number (ℝ)

MISSING  ZEROS 

Distinct122
Distinct (%)< 0.1%
Missing70276
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean8.413438782
Minimum0
Maximum159
Zeros68944
Zeros (%)3.0%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:45.679595image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q13
median6
Q311
95-th percentile23
Maximum159
Range159
Interquartile range (IQR)8

Descriptive statistics

Standard deviation7.359113771
Coefficient of variation (CV)0.8746856026
Kurtosis7.841626515
Mean8.413438782
Median Absolute Deviation (MAD)3
Skewness2.103554166
Sum18428729
Variance54.15655549
MonotonicityNot monotonic
2023-04-16T23:54:45.764049image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4 190600
 
8.4%
3 190128
 
8.4%
5 180396
 
8.0%
2 173360
 
7.7%
6 165370
 
7.3%
7 147949
 
6.5%
1 133227
 
5.9%
8 129276
 
5.7%
9 111342
 
4.9%
10 96604
 
4.3%
Other values (112) 672140
29.7%
ValueCountFrequency (%)
0 68944
 
3.0%
1 133227
5.9%
2 173360
7.7%
3 190128
8.4%
4 190600
8.4%
ValueCountFrequency (%)
159 1
< 0.1%
150 1
< 0.1%
140 1
< 0.1%
138 1
< 0.1%
132 1
< 0.1%

num_op_rev_tl
Real number (ℝ)

Distinct81
Distinct (%)< 0.1%
Missing70276
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean8.24652254
Minimum0
Maximum91
Zeros1113
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:45.876846image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3
Q15
median7
Q310
95-th percentile17
Maximum91
Range91
Interquartile range (IQR)5

Descriptive statistics

Standard deviation4.683927892
Coefficient of variation (CV)0.5679882483
Kurtosis4.732174134
Mean8.24652254
Median Absolute Deviation (MAD)3
Skewness1.526702877
Sum18063117
Variance21.9391805
MonotonicityNot monotonic
2023-04-16T23:54:45.972322image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
6 236912
10.5%
5 231071
10.2%
7 222688
9.9%
4 201987
8.9%
8 198673
8.8%
9 169770
 
7.5%
3 145954
 
6.5%
10 141418
 
6.3%
11 114837
 
5.1%
12 92236
 
4.1%
Other values (71) 434846
19.2%
ValueCountFrequency (%)
0 1113
 
< 0.1%
1 18563
 
0.8%
2 77763
 
3.4%
3 145954
6.5%
4 201987
8.9%
ValueCountFrequency (%)
91 3
< 0.1%
86 1
 
< 0.1%
83 1
 
< 0.1%
81 1
 
< 0.1%
79 1
 
< 0.1%

num_rev_accts
Real number (ℝ)

Distinct117
Distinct (%)< 0.1%
Missing70277
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean14.00462977
Minimum0
Maximum151
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:46.078296image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile4
Q18
median12
Q318
95-th percentile29
Maximum151
Range151
Interquartile range (IQR)10

Descriptive statistics

Standard deviation8.038867538
Coefficient of variation (CV)0.5740149987
Kurtosis3.460986383
Mean14.00462977
Median Absolute Deviation (MAD)5
Skewness1.369541362
Sum30675615
Variance64.62339129
MonotonicityNot monotonic
2023-04-16T23:54:46.171021image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10 132715
 
5.9%
9 132157
 
5.8%
8 129674
 
5.7%
11 129189
 
5.7%
12 123608
 
5.5%
7 122058
 
5.4%
13 116460
 
5.2%
6 109695
 
4.9%
14 107620
 
4.8%
15 99544
 
4.4%
Other values (107) 987671
43.7%
ValueCountFrequency (%)
0 1
 
< 0.1%
1 19
 
< 0.1%
2 21211
 
0.9%
3 44334
2.0%
4 69400
3.1%
ValueCountFrequency (%)
151 1
< 0.1%
143 1
< 0.1%
128 1
< 0.1%
127 2
< 0.1%
119 1
< 0.1%

num_rev_tl_bal_gt_0
Real number (ℝ)

Distinct50
Distinct (%)< 0.1%
Missing70276
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean5.577950887
Minimum0
Maximum65
Zeros11252
Zeros (%)0.5%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:46.267663image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q13
median5
Q37
95-th percentile12
Maximum65
Range65
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.293433856
Coefficient of variation (CV)0.5904379444
Kurtosis4.173233011
Mean5.577950887
Median Absolute Deviation (MAD)2
Skewness1.470687356
Sum12217899
Variance10.84670656
MonotonicityNot monotonic
2023-04-16T23:54:46.362482image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4 335837
14.9%
3 310629
13.7%
5 306205
13.5%
6 249901
11.1%
2 214144
9.5%
7 192557
8.5%
8 141264
6.2%
9 102127
 
4.5%
1 84778
 
3.8%
10 71574
 
3.2%
Other values (40) 181376
8.0%
(Missing) 70276
 
3.1%
ValueCountFrequency (%)
0 11252
 
0.5%
1 84778
 
3.8%
2 214144
9.5%
3 310629
13.7%
4 335837
14.9%
ValueCountFrequency (%)
65 1
 
< 0.1%
59 2
< 0.1%
55 1
 
< 0.1%
47 1
 
< 0.1%
45 3
< 0.1%

num_sats
Real number (ℝ)

Distinct91
Distinct (%)< 0.1%
Missing58590
Missing (%)2.6%
Infinite0
Infinite (%)0.0%
Mean11.62812988
Minimum0
Maximum101
Zeros61
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:46.457098image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile5
Q18
median11
Q314
95-th percentile22
Maximum101
Range101
Interquartile range (IQR)6

Descriptive statistics

Standard deviation5.644026548
Coefficient of variation (CV)0.4853769784
Kurtosis3.421898901
Mean11.62812988
Median Absolute Deviation (MAD)3
Skewness1.314627806
Sum25606049
Variance31.85503568
MonotonicityNot monotonic
2023-04-16T23:54:46.553822image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9 190842
 
8.4%
10 185016
 
8.2%
8 183966
 
8.1%
11 170664
 
7.5%
7 168032
 
7.4%
12 153317
 
6.8%
6 141290
 
6.2%
13 133971
 
5.9%
14 115266
 
5.1%
5 105440
 
4.7%
Other values (81) 654274
28.9%
ValueCountFrequency (%)
0 61
 
< 0.1%
1 1634
 
0.1%
2 10216
 
0.5%
3 30948
1.4%
4 65577
2.9%
ValueCountFrequency (%)
101 1
< 0.1%
97 1
< 0.1%
94 1
< 0.1%
93 1
< 0.1%
91 1
< 0.1%

num_tl_120dpd_2m
Real number (ℝ)

MISSING  SKEWED  ZEROS 

Distinct7
Distinct (%)< 0.1%
Missing153657
Missing (%)6.8%
Infinite0
Infinite (%)0.0%
Mean0.0006373958181
Minimum0
Maximum7
Zeros2105738
Zeros (%)93.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:46.641662image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum7
Range7
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.02710643324
Coefficient of variation (CV)42.52684513
Kurtosis5541.593233
Mean0.0006373958181
Median Absolute Deviation (MAD)0
Skewness55.80984712
Sum1343
Variance0.0007347587232
MonotonicityNot monotonic
2023-04-16T23:54:46.701580image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
0 2105738
93.1%
1 1219
 
0.1%
2 46
 
< 0.1%
3 5
 
< 0.1%
4 1
 
< 0.1%
6 1
 
< 0.1%
7 1
 
< 0.1%
(Missing) 153657
 
6.8%
ValueCountFrequency (%)
0 2105738
93.1%
1 1219
 
0.1%
2 46
 
< 0.1%
3 5
 
< 0.1%
4 1
 
< 0.1%
ValueCountFrequency (%)
7 1
 
< 0.1%
6 1
 
< 0.1%
4 1
 
< 0.1%
3 5
 
< 0.1%
2 46
< 0.1%

num_tl_30dpd
Real number (ℝ)

MISSING  SKEWED  ZEROS 

Distinct5
Distinct (%)< 0.1%
Missing70276
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean0.00281365162
Minimum0
Maximum4
Zeros2184561
Zeros (%)96.6%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:46.771791image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum4
Range4
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.05616522447
Coefficient of variation (CV)19.96168398
Kurtosis622.4309597
Mean0.00281365162
Median Absolute Deviation (MAD)0
Skewness22.51746312
Sum6163
Variance0.00315453244
MonotonicityNot monotonic
2023-04-16T23:54:46.836784image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=5)
ValueCountFrequency (%)
0 2184561
96.6%
1 5542
 
0.2%
2 253
 
< 0.1%
3 29
 
< 0.1%
4 7
 
< 0.1%
(Missing) 70276
 
3.1%
ValueCountFrequency (%)
0 2184561
96.6%
1 5542
 
0.2%
2 253
 
< 0.1%
3 29
 
< 0.1%
4 7
 
< 0.1%
ValueCountFrequency (%)
4 7
 
< 0.1%
3 29
 
< 0.1%
2 253
 
< 0.1%
1 5542
 
0.2%
0 2184561
96.6%

num_tl_90g_dpd_24m
Real number (ℝ)

MISSING  ZEROS 

Distinct34
Distinct (%)< 0.1%
Missing70276
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean0.08293766595
Minimum0
Maximum58
Zeros2073060
Zeros (%)91.7%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:46.923560image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum58
Range58
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.4935732136
Coefficient of variation (CV)5.951134602
Kurtosis459.1979059
Mean0.08293766595
Median Absolute Deviation (MAD)0
Skewness14.90157353
Sum181666
Variance0.2436145172
MonotonicityNot monotonic
2023-04-16T23:54:47.002801image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
0 2073060
91.7%
1 88450
 
3.9%
2 16897
 
0.7%
3 4672
 
0.2%
4 2653
 
0.1%
5 1398
 
0.1%
6 1018
 
< 0.1%
7 604
 
< 0.1%
8 467
 
< 0.1%
9 340
 
< 0.1%
Other values (24) 833
 
< 0.1%
(Missing) 70276
 
3.1%
ValueCountFrequency (%)
0 2073060
91.7%
1 88450
 
3.9%
2 16897
 
0.7%
3 4672
 
0.2%
4 2653
 
0.1%
ValueCountFrequency (%)
58 1
< 0.1%
42 1
< 0.1%
39 1
< 0.1%
36 1
< 0.1%
35 1
< 0.1%

num_tl_op_past_12m
Real number (ℝ)

MISSING  ZEROS 

Distinct33
Distinct (%)< 0.1%
Missing70276
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean2.076755211
Minimum0
Maximum32
Zeros415975
Zeros (%)18.4%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:47.083173image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q33
95-th percentile5
Maximum32
Range32
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.830710765
Coefficient of variation (CV)0.8815245798
Kurtosis4.69604436
Mean2.076755211
Median Absolute Deviation (MAD)1
Skewness1.503461003
Sum4548908
Variance3.351501904
MonotonicityNot monotonic
2023-04-16T23:54:47.163593image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
1 558928
24.7%
2 483541
21.4%
0 415975
18.4%
3 335448
14.8%
4 195095
 
8.6%
5 97421
 
4.3%
6 48518
 
2.1%
7 25934
 
1.1%
8 13066
 
0.6%
9 7131
 
0.3%
Other values (23) 9335
 
0.4%
(Missing) 70276
 
3.1%
ValueCountFrequency (%)
0 415975
18.4%
1 558928
24.7%
2 483541
21.4%
3 335448
14.8%
4 195095
 
8.6%
ValueCountFrequency (%)
32 1
 
< 0.1%
31 1
 
< 0.1%
30 2
< 0.1%
29 1
 
< 0.1%
28 4
< 0.1%

open_acc
Real number (ℝ)

Distinct91
Distinct (%)< 0.1%
Missing29
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean11.61240207
Minimum0
Maximum101
Zeros56
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:47.268065image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile5
Q18
median11
Q314
95-th percentile22
Maximum101
Range101
Interquartile range (IQR)6

Descriptive statistics

Standard deviation5.640861338
Coefficient of variation (CV)0.4857618006
Kurtosis3.446376782
Mean11.61240207
Median Absolute Deviation (MAD)3
Skewness1.315544951
Sum26251449
Variance31.81931664
MonotonicityNot monotonic
2023-04-16T23:54:47.353119image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9 195762
 
8.7%
10 189737
 
8.4%
8 188717
 
8.3%
11 175101
 
7.7%
7 172834
 
7.6%
12 157331
 
7.0%
6 145444
 
6.4%
13 137502
 
6.1%
14 118314
 
5.2%
5 108565
 
4.8%
Other values (81) 671332
29.7%
ValueCountFrequency (%)
0 56
 
< 0.1%
1 1644
 
0.1%
2 10860
 
0.5%
3 32428
1.4%
4 67827
3.0%
ValueCountFrequency (%)
101 1
< 0.1%
97 1
< 0.1%
94 1
< 0.1%
93 1
< 0.1%
91 1
< 0.1%

open_acc_6m
Real number (ℝ)

MISSING  ZEROS 

Distinct19
Distinct (%)< 0.1%
Missing866130
Missing (%)38.3%
Infinite0
Infinite (%)0.0%
Mean0.934419858
Minimum0
Maximum18
Zeros627966
Zeros (%)27.8%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:47.450001image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q31
95-th percentile3
Maximum18
Range18
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.140699932
Coefficient of variation (CV)1.220757374
Kurtosis4.480970574
Mean0.934419858
Median Absolute Deviation (MAD)1
Skewness1.681378599
Sum1303084
Variance1.301196335
MonotonicityNot monotonic
2023-04-16T23:54:47.523967image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
0 627966
27.8%
1 434092
19.2%
2 204167
 
9.0%
3 81147
 
3.6%
4 29906
 
1.3%
5 10643
 
0.5%
6 3956
 
0.2%
7 1556
 
0.1%
8 625
 
< 0.1%
9 259
 
< 0.1%
Other values (9) 221
 
< 0.1%
(Missing) 866130
38.3%
ValueCountFrequency (%)
0 627966
27.8%
1 434092
19.2%
2 204167
 
9.0%
3 81147
 
3.6%
4 29906
 
1.3%
ValueCountFrequency (%)
18 1
 
< 0.1%
17 1
 
< 0.1%
16 2
 
< 0.1%
15 2
 
< 0.1%
14 9
< 0.1%

open_act_il
Real number (ℝ)

MISSING  ZEROS 

Distinct54
Distinct (%)< 0.1%
Missing866129
Missing (%)38.3%
Infinite0
Infinite (%)0.0%
Mean2.779406671
Minimum0
Maximum57
Zeros165848
Zeros (%)7.3%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:47.612648image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q33
95-th percentile9
Maximum57
Range57
Interquartile range (IQR)2

Descriptive statistics

Standard deviation3.000784358
Coefficient of variation (CV)1.079649261
Kurtosis13.71859618
Mean2.779406671
Median Absolute Deviation (MAD)1
Skewness2.976152472
Sum3875991
Variance9.004706761
MonotonicityNot monotonic
2023-04-16T23:54:47.707452image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 356302
15.8%
2 334086
 
14.8%
3 209677
 
9.3%
0 165848
 
7.3%
4 112592
 
5.0%
5 62357
 
2.8%
6 37624
 
1.7%
7 25961
 
1.1%
8 19050
 
0.8%
9 14849
 
0.7%
Other values (44) 56193
 
2.5%
(Missing) 866129
38.3%
ValueCountFrequency (%)
0 165848
7.3%
1 356302
15.8%
2 334086
14.8%
3 209677
9.3%
4 112592
 
5.0%
ValueCountFrequency (%)
57 1
< 0.1%
56 1
< 0.1%
55 1
< 0.1%
53 1
< 0.1%
49 2
< 0.1%

open_il_12m
Real number (ℝ)

MISSING  ZEROS 

Distinct19
Distinct (%)< 0.1%
Missing866129
Missing (%)38.3%
Infinite0
Infinite (%)0.0%
Mean0.6764314229
Minimum0
Maximum25
Zeros760254
Zeros (%)33.6%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:47.785527image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile2
Maximum25
Range25
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.925635427
Coefficient of variation (CV)1.368409858
Kurtosis5.428570249
Mean0.6764314229
Median Absolute Deviation (MAD)0
Skewness1.791122671
Sum943310
Variance0.8568009438
MonotonicityNot monotonic
2023-04-16T23:54:47.861457image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
0 760254
33.6%
1 418123
18.5%
2 152164
 
6.7%
3 44170
 
2.0%
4 13608
 
0.6%
5 4309
 
0.2%
6 1484
 
0.1%
7 219
 
< 0.1%
8 97
 
< 0.1%
9 49
 
< 0.1%
Other values (9) 62
 
< 0.1%
(Missing) 866129
38.3%
ValueCountFrequency (%)
0 760254
33.6%
1 418123
18.5%
2 152164
 
6.7%
3 44170
 
2.0%
4 13608
 
0.6%
ValueCountFrequency (%)
25 1
< 0.1%
21 1
< 0.1%
20 2
< 0.1%
15 1
< 0.1%
14 1
< 0.1%

open_il_24m
Real number (ℝ)

MISSING  ZEROS 

Distinct31
Distinct (%)< 0.1%
Missing866129
Missing (%)38.3%
Infinite0
Infinite (%)0.0%
Mean1.562751562
Minimum0
Maximum51
Zeros377489
Zeros (%)16.7%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:47.940638image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q32
95-th percentile5
Maximum51
Range51
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.578672095
Coefficient of variation (CV)1.010187501
Kurtosis6.489226867
Mean1.562751562
Median Absolute Deviation (MAD)1
Skewness1.760411311
Sum2179318
Variance2.492205585
MonotonicityNot monotonic
2023-04-16T23:54:48.034739image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
1 439811
19.5%
0 377489
16.7%
2 284333
 
12.6%
3 148127
 
6.6%
4 72627
 
3.2%
5 36227
 
1.6%
6 17577
 
0.8%
7 8851
 
0.4%
8 4506
 
0.2%
9 2262
 
0.1%
Other values (21) 2729
 
0.1%
(Missing) 866129
38.3%
ValueCountFrequency (%)
0 377489
16.7%
1 439811
19.5%
2 284333
12.6%
3 148127
 
6.6%
4 72627
 
3.2%
ValueCountFrequency (%)
51 1
< 0.1%
39 1
< 0.1%
31 1
< 0.1%
30 1
< 0.1%
28 1
< 0.1%

open_rv_12m
Real number (ℝ)

MISSING  ZEROS 

Distinct29
Distinct (%)< 0.1%
Missing866129
Missing (%)38.3%
Infinite0
Infinite (%)0.0%
Mean1.290133155
Minimum0
Maximum28
Zeros513716
Zeros (%)22.7%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:48.120724image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q32
95-th percentile4
Maximum28
Range28
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.506826647
Coefficient of variation (CV)1.167962114
Kurtosis7.794958019
Mean1.290133155
Median Absolute Deviation (MAD)1
Skewness2.019059937
Sum1799141
Variance2.270526544
MonotonicityNot monotonic
2023-04-16T23:54:48.197996image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
0 513716
22.7%
1 413897
18.3%
2 238188
 
10.5%
3 118920
 
5.3%
4 56137
 
2.5%
5 26685
 
1.2%
6 13085
 
0.6%
7 6466
 
0.3%
8 3218
 
0.1%
9 1778
 
0.1%
Other values (19) 2449
 
0.1%
(Missing) 866129
38.3%
ValueCountFrequency (%)
0 513716
22.7%
1 413897
18.3%
2 238188
10.5%
3 118920
 
5.3%
4 56137
 
2.5%
ValueCountFrequency (%)
28 2
 
< 0.1%
27 1
 
< 0.1%
26 5
< 0.1%
25 1
 
< 0.1%
24 2
 
< 0.1%

open_rv_24m
Real number (ℝ)

MISSING  ZEROS 

Distinct50
Distinct (%)< 0.1%
Missing866129
Missing (%)38.3%
Infinite0
Infinite (%)0.0%
Mean2.749923093
Minimum0
Maximum60
Zeros223783
Zeros (%)9.9%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:48.299128image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q34
95-th percentile8
Maximum60
Range60
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.596910679
Coefficient of variation (CV)0.9443575663
Kurtosis8.208281738
Mean2.749923093
Median Absolute Deviation (MAD)1
Skewness1.977657369
Sum3834875
Variance6.743945077
MonotonicityNot monotonic
2023-04-16T23:54:48.387806image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 297887
 
13.2%
2 267192
 
11.8%
0 223783
 
9.9%
3 200886
 
8.9%
4 139066
 
6.2%
5 92969
 
4.1%
6 60078
 
2.7%
7 39084
 
1.7%
8 24997
 
1.1%
9 16162
 
0.7%
Other values (40) 32435
 
1.4%
(Missing) 866129
38.3%
ValueCountFrequency (%)
0 223783
9.9%
1 297887
13.2%
2 267192
11.8%
3 200886
8.9%
4 139066
6.2%
ValueCountFrequency (%)
60 1
< 0.1%
54 1
< 0.1%
53 1
< 0.1%
50 2
< 0.1%
49 1
< 0.1%
Distinct7313
Distinct (%)86.8%
Missing2252242
Missing (%)99.6%
Infinite0
Infinite (%)0.0%
Mean454.8408023
Minimum1.92
Maximum2680.89
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:48.482603image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum1.92
5-th percentile62.1075
Q1174.9675
median352.605
Q3622.7925
95-th percentile1188.945
Maximum2680.89
Range2678.97
Interquartile range (IQR)447.825

Descriptive statistics

Standard deviation375.8307374
Coefficient of variation (CV)0.8262907274
Kurtosis3.32701935
Mean454.8408023
Median Absolute Deviation (MAD)202.665
Skewness1.599409071
Sum3832488.6
Variance141248.7432
MonotonicityNot monotonic
2023-04-16T23:54:48.580129image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
283.77 5
 
< 0.1%
396.99 5
 
< 0.1%
200.37 4
 
< 0.1%
186.9 4
 
< 0.1%
449.13 4
 
< 0.1%
163.65 4
 
< 0.1%
154.77 4
 
< 0.1%
226.08 4
 
< 0.1%
145.68 4
 
< 0.1%
268.14 4
 
< 0.1%
Other values (7303) 8384
 
0.4%
(Missing) 2252242
99.6%
ValueCountFrequency (%)
1.92 1
< 0.1%
4.41 1
< 0.1%
6.06 1
< 0.1%
6.45 1
< 0.1%
10.17 1
< 0.1%
ValueCountFrequency (%)
2680.89 1
< 0.1%
2679.15 1
< 0.1%
2535.66 1
< 0.1%
2513.04 1
< 0.1%
2486.94 1
< 0.1%

out_prncp
Real number (ℝ)

Distinct364399
Distinct (%)16.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4446.292883
Minimum0
Maximum40000
Zeros1312200
Zeros (%)58.0%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:48.672641image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q36712.6325
95-th percentile21732.94
Maximum40000
Range40000
Interquartile range (IQR)6712.6325

Descriptive statistics

Standard deviation7547.611729
Coefficient of variation (CV)1.697506648
Kurtosis3.745122988
Mean4446.292883
Median Absolute Deviation (MAD)0
Skewness2.010716097
Sum1.005159204 × 1010
Variance56966442.81
MonotonicityNot monotonic
2023-04-16T23:54:48.783261image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1312200
58.0%
8717.99 302
 
< 0.1%
9009.59 285
 
< 0.1%
8457.66 284
 
< 0.1%
9493.69 277
 
< 0.1%
8977.01 276
 
< 0.1%
9510.8 259
 
< 0.1%
9001.08 231
 
< 0.1%
9051.41 231
 
< 0.1%
8503.85 230
 
< 0.1%
Other values (364389) 946093
41.9%
ValueCountFrequency (%)
0 1312200
58.0%
0.01 3
 
< 0.1%
0.02 4
 
< 0.1%
0.03 3
 
< 0.1%
0.04 5
 
< 0.1%
ValueCountFrequency (%)
40000 3
< 0.1%
39982.36 1
 
< 0.1%
39800 1
 
< 0.1%
39662.62 1
 
< 0.1%
39616.66 1
 
< 0.1%

out_prncp_inv
Real number (ℝ)

Distinct377353
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4445.294869
Minimum0
Maximum40000
Zeros1312200
Zeros (%)58.0%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:48.879209image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q36710.32
95-th percentile21729.82
Maximum40000
Range40000
Interquartile range (IQR)6710.32

Descriptive statistics

Standard deviation7546.656886
Coefficient of variation (CV)1.697672957
Kurtosis3.747150401
Mean4445.294869
Median Absolute Deviation (MAD)0
Skewness2.011121433
Sum1.004933586 × 1010
Variance56952030.16
MonotonicityNot monotonic
2023-04-16T23:54:48.972609image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1312200
58.0%
8717.99 300
 
< 0.1%
8457.66 282
 
< 0.1%
9009.59 280
 
< 0.1%
9493.69 275
 
< 0.1%
8977.01 272
 
< 0.1%
9510.8 254
 
< 0.1%
8503.85 230
 
< 0.1%
9001.08 227
 
< 0.1%
9051.41 225
 
< 0.1%
Other values (377343) 946123
41.9%
ValueCountFrequency (%)
0 1312200
58.0%
0.01 3
 
< 0.1%
0.02 4
 
< 0.1%
0.03 3
 
< 0.1%
0.04 5
 
< 0.1%
ValueCountFrequency (%)
40000 3
< 0.1%
39982.36 1
 
< 0.1%
39800 1
 
< 0.1%
39662.62 1
 
< 0.1%
39616.66 1
 
< 0.1%

payment_plan_start_date
Categorical

IMBALANCE  MISSING 

Distinct27
Distinct (%)< 0.1%
Missing95321
Missing (%)4.2%
Memory size17.2 MiB
2154734 
Sep-2017
 
1715
Oct-2017
 
1629
Nov-2017
 
640
Oct-2018
 
538
Other values (22)
 
6091

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
2154734
95.3%
Sep-2017 1715
 
0.1%
Oct-2017 1629
 
0.1%
Nov-2017 640
 
< 0.1%
Oct-2018 538
 
< 0.1%
Nov-2018 481
 
< 0.1%
Aug-2018 456
 
< 0.1%
Sep-2018 416
 
< 0.1%
Dec-2017 413
 
< 0.1%
Jun-2017 394
 
< 0.1%
Other values (17) 3931
 
0.2%
(Missing) 95321
 
4.2%

pct_tl_nvr_dlq
Real number (ℝ)

Distinct690
Distinct (%)< 0.1%
Missing70431
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean94.11457646
Minimum0
Maximum100
Zeros13
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:49.084816image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile75
Q191.3
median100
Q3100
95-th percentile100
Maximum100
Range100
Interquartile range (IQR)8.7

Descriptive statistics

Standard deviation9.036140361
Coefficient of variation (CV)0.09601212374
Kurtosis6.854075654
Mean94.11457646
Median Absolute Deviation (MAD)0
Skewness-2.277944445
Sum206133227.6
Variance81.65183263
MonotonicityNot monotonic
2023-04-16T23:54:49.186797image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100 1106209
48.9%
90 29542
 
1.3%
95 27840
 
1.2%
96 24023
 
1.1%
91.7 23278
 
1.0%
90.9 23274
 
1.0%
92.3 22822
 
1.0%
88.9 22667
 
1.0%
87.5 22258
 
1.0%
92.9 22101
 
1.0%
Other values (680) 866223
38.3%
(Missing) 70431
 
3.1%
ValueCountFrequency (%)
0 13
< 0.1%
5 1
 
< 0.1%
5.9 1
 
< 0.1%
6.7 1
 
< 0.1%
7.1 2
 
< 0.1%
ValueCountFrequency (%)
100 1106209
48.9%
99.4 2
 
< 0.1%
99.3 1
 
< 0.1%
99.2 9
 
< 0.1%
99.1 15
 
< 0.1%

percent_bc_gt_75
Real number (ℝ)

MISSING  ZEROS 

Distinct284
Distinct (%)< 0.1%
Missing75379
Missing (%)3.3%
Infinite0
Infinite (%)0.0%
Mean42.43512654
Minimum0
Maximum100
Zeros598711
Zeros (%)26.5%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:49.277447image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median37.5
Q371.4
95-th percentile100
Maximum100
Range100
Interquartile range (IQR)71.4

Descriptive statistics

Standard deviation36.21615733
Coefficient of variation (CV)0.85344761
Kurtosis-1.25565792
Mean42.43512654
Median Absolute Deviation (MAD)37.5
Skewness0.3091997107
Sum92733015.24
Variance1311.610051
MonotonicityNot monotonic
2023-04-16T23:54:49.370597image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 598711
26.5%
100 372700
16.5%
50 231935
 
10.3%
33.3 147284
 
6.5%
66.7 134722
 
6.0%
25 97801
 
4.3%
75 76405
 
3.4%
20 64820
 
2.9%
40 55574
 
2.5%
60 47850
 
2.1%
Other values (274) 357487
15.8%
(Missing) 75379
 
3.3%
ValueCountFrequency (%)
0 598711
26.5%
0.14 2
 
< 0.1%
0.17 1
 
< 0.1%
0.2 10
 
< 0.1%
0.25 19
 
< 0.1%
ValueCountFrequency (%)
100 372700
16.5%
95.8 1
 
< 0.1%
95.5 2
 
< 0.1%
95.2 2
 
< 0.1%
95 3
 
< 0.1%
Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.2 MiB
True
2260668 
ValueCountFrequency (%)
True 2260668
100.0%
2023-04-16T23:54:49.449756image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

pub_rec
Real number (ℝ)

Distinct43
Distinct (%)< 0.1%
Missing29
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean0.1975277787
Minimum0
Maximum86
Zeros1902758
Zeros (%)84.2%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:49.528443image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum86
Range86
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.5705150143
Coefficient of variation (CV)2.888277377
Kurtosis704.1159105
Mean0.1975277787
Median Absolute Deviation (MAD)0
Skewness11.37680843
Sum446539
Variance0.3254873815
MonotonicityNot monotonic
2023-04-16T23:54:49.608847image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=43)
ValueCountFrequency (%)
0 1902758
84.2%
1 305390
 
13.5%
2 34154
 
1.5%
3 10567
 
0.5%
4 3872
 
0.2%
5 1843
 
0.1%
6 933
 
< 0.1%
7 427
 
< 0.1%
8 243
 
< 0.1%
9 143
 
< 0.1%
Other values (33) 309
 
< 0.1%
ValueCountFrequency (%)
0 1902758
84.2%
1 305390
 
13.5%
2 34154
 
1.5%
3 10567
 
0.5%
4 3872
 
0.2%
ValueCountFrequency (%)
86 1
< 0.1%
63 1
< 0.1%
61 2
< 0.1%
54 1
< 0.1%
52 1
< 0.1%

pub_rec_bankruptcies
Real number (ℝ)

Distinct12
Distinct (%)< 0.1%
Missing1365
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean0.1281935181
Minimum0
Maximum12
Zeros1987383
Zeros (%)87.9%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:49.690851image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum12
Range12
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.3646129975
Coefficient of variation (CV)2.844238951
Kurtosis18.65787015
Mean0.1281935181
Median Absolute Deviation (MAD)0
Skewness3.37118635
Sum289628
Variance0.1329426379
MonotonicityNot monotonic
2023-04-16T23:54:49.750149image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
0 1987383
87.9%
1 258444
 
11.4%
2 10518
 
0.5%
3 2131
 
0.1%
4 541
 
< 0.1%
5 188
 
< 0.1%
6 60
 
< 0.1%
7 23
 
< 0.1%
8 10
 
< 0.1%
9 3
 
< 0.1%
Other values (2) 2
 
< 0.1%
(Missing) 1365
 
0.1%
ValueCountFrequency (%)
0 1987383
87.9%
1 258444
 
11.4%
2 10518
 
0.5%
3 2131
 
0.1%
4 541
 
< 0.1%
ValueCountFrequency (%)
12 1
 
< 0.1%
11 1
 
< 0.1%
9 3
 
< 0.1%
8 10
< 0.1%
7 23
< 0.1%

purpose
Categorical

Distinct14
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.2 MiB
debt_consolidation
1277877 
credit_card
516971 
home_improvement
150457 
other
139440 
major_purchase
 
50445
Other values (9)
 
125478

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowdebt_consolidation
2nd rowdebt_consolidation
3rd rowdebt_consolidation
4th rowdebt_consolidation
5th rowdebt_consolidation

Common Values

ValueCountFrequency (%)
debt_consolidation 1277877
56.5%
credit_card 516971
22.9%
home_improvement 150457
 
6.7%
other 139440
 
6.2%
major_purchase 50445
 
2.2%
medical 27488
 
1.2%
small_business 24689
 
1.1%
car 24013
 
1.1%
vacation 15525
 
0.7%
moving 15403
 
0.7%
Other values (4) 18360
 
0.8%

pymnt_plan
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.2 MiB
n
2259986 
y
 
682

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rown
2nd rown
3rd rown
4th rown
5th rown

Common Values

ValueCountFrequency (%)
n 2259986
> 99.9%
y 682
 
< 0.1%

Common Values (Plot)

2023-04-16T23:54:49.829283image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

recoveries
Real number (ℝ)

Distinct127920
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean136.0739977
Minimum0
Maximum39859.55
Zeros2083167
Zeros (%)92.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:49.908428image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile804.91
Maximum39859.55
Range39859.55
Interquartile range (IQR)0

Descriptive statistics

Standard deviation725.8316778
Coefficient of variation (CV)5.334095347
Kurtosis213.5673225
Mean136.0739977
Median Absolute Deviation (MAD)0
Skewness10.99948243
Sum307618132.2
Variance526831.6246
MonotonicityNot monotonic
2023-04-16T23:54:50.484737image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2083167
92.1%
100 820
 
< 0.1%
50 787
 
< 0.1%
150 615
 
< 0.1%
200 470
 
< 0.1%
25 350
 
< 0.1%
75 305
 
< 0.1%
250 299
 
< 0.1%
300 294
 
< 0.1%
400 209
 
< 0.1%
Other values (127910) 173352
 
7.7%
ValueCountFrequency (%)
0 2083167
92.1%
0.01 14
 
< 0.1%
0.02 16
 
< 0.1%
0.03 25
 
< 0.1%
0.04 39
 
< 0.1%
ValueCountFrequency (%)
39859.55 1
< 0.1%
39444.37 1
< 0.1%
37153.46 1
< 0.1%
36578.54 1
< 0.1%
35581.88 1
< 0.1%

revol_bal
Real number (ℝ)

Distinct102251
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16658.45808
Minimum0
Maximum2904836
Zeros12562
Zeros (%)0.6%
Negative0
Negative (%)0.0%
Memory size8.6 MiB
2023-04-16T23:54:50.593049image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1572
Q15950
median11324
Q320246
95-th percentile45164
Maximum2904836
Range2904836
Interquartile range (IQR)14296

Descriptive statistics

Standard deviation22948.30503
Coefficient of variation (CV)1.377576779
Kurtosis643.1980554
Mean16658.45808
Median Absolute Deviation (MAD)6381
Skewness13.23198843
Sum3.765924311 × 1010
Variance526624703.6
MonotonicityNot monotonic
2023-04-16T23:54:50.688877image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 12562
 
0.6%
8 216
 
< 0.1%
10 170
 
< 0.1%
2 169
 
< 0.1%
5 160
 
< 0.1%
5235 160
 
< 0.1%
6312 158
 
< 0.1%
5849 158
 
< 0.1%
5265 156
 
< 0.1%
6118 156
 
< 0.1%
Other values (102241) 2246603
99.4%
ValueCountFrequency (%)
0 12562
0.6%
1 123
 
< 0.1%
2 169
 
< 0.1%
3 151
 
< 0.1%
4 153
 
< 0.1%
ValueCountFrequency (%)
2904836 1
< 0.1%
2568995 1
< 0.1%
2560703 1
< 0.1%
2559552 1
< 0.1%
2358150 1
< 0.1%

revol_bal_joint
Real number (ℝ)

Distinct56875
Distinct (%)52.7%
Missing2152648
Missing (%)95.2%
Infinite0
Infinite (%)0.0%
Mean33617.27885
Minimum0
Maximum1110019
Zeros106
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:50.784832image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile5358
Q115106.75
median26516.5
Q343769
95-th percentile85002.05
Maximum1110019
Range1110019
Interquartile range (IQR)28662.25

Descriptive statistics

Standard deviation28153.87431
Coefficient of variation (CV)0.8374822495
Kurtosis32.57097407
Mean33617.27885
Median Absolute Deviation (MAD)13296
Skewness3.024207091
Sum3631338461
Variance792640638.6
MonotonicityNot monotonic
2023-04-16T23:54:50.878894image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 106
 
< 0.1%
20804 10
 
< 0.1%
13274 9
 
< 0.1%
11771 9
 
< 0.1%
26505 9
 
< 0.1%
22263 9
 
< 0.1%
12149 9
 
< 0.1%
19973 9
 
< 0.1%
20811 9
 
< 0.1%
10041 9
 
< 0.1%
Other values (56865) 107832
 
4.8%
(Missing) 2152648
95.2%
ValueCountFrequency (%)
0 106
< 0.1%
2 1
 
< 0.1%
3 1
 
< 0.1%
4 1
 
< 0.1%
5 1
 
< 0.1%
ValueCountFrequency (%)
1110019 1
< 0.1%
517755 1
< 0.1%
476826 1
< 0.1%
426860 1
< 0.1%
412216 1
< 0.1%

revol_util
Real number (ℝ)

Distinct1430
Distinct (%)0.1%
Missing1802
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean50.33769625
Minimum0
Maximum892.3
Zeros13069
Zeros (%)0.6%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:50.975093image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile9.4
Q131.5
median50.3
Q369.4
95-th percentile91
Maximum892.3
Range892.3
Interquartile range (IQR)37.9

Descriptive statistics

Standard deviation24.71307332
Coefficient of variation (CV)0.4909456563
Kurtosis-0.2226717499
Mean50.33769625
Median Absolute Deviation (MAD)18.9
Skewness0.01255594308
Sum113706110.6
Variance610.735993
MonotonicityNot monotonic
2023-04-16T23:54:51.069105image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 13069
 
0.6%
57 4324
 
0.2%
48 4283
 
0.2%
59 4272
 
0.2%
61 4223
 
0.2%
54 4190
 
0.2%
58 4188
 
0.2%
53 4185
 
0.2%
55 4181
 
0.2%
51 4175
 
0.2%
Other values (1420) 2207776
97.7%
ValueCountFrequency (%)
0 13069
0.6%
0.01 1
 
< 0.1%
0.03 1
 
< 0.1%
0.04 1
 
< 0.1%
0.05 1
 
< 0.1%
ValueCountFrequency (%)
892.3 1
< 0.1%
366.6 1
< 0.1%
193 1
< 0.1%
191 1
< 0.1%
184.6 1
< 0.1%

sec_app_chargeoff_within_12_mths
Real number (ℝ)

MISSING  SKEWED  ZEROS 

Distinct22
Distinct (%)< 0.1%
Missing2152647
Missing (%)95.2%
Infinite0
Infinite (%)0.0%
Mean0.0463520982
Minimum0
Maximum21
Zeros105117
Zeros (%)4.6%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:51.150989image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum21
Range21
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.4114960112
Coefficient of variation (CV)8.877613467
Kurtosis640.9638779
Mean0.0463520982
Median Absolute Deviation (MAD)0
Skewness20.27699345
Sum5007
Variance0.1693289673
MonotonicityNot monotonic
2023-04-16T23:54:51.213710image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
0 105117
 
4.6%
1 2073
 
0.1%
2 429
 
< 0.1%
3 146
 
< 0.1%
4 90
 
< 0.1%
5 57
 
< 0.1%
6 30
 
< 0.1%
7 20
 
< 0.1%
10 14
 
< 0.1%
8 12
 
< 0.1%
Other values (12) 33
 
< 0.1%
(Missing) 2152647
95.2%
ValueCountFrequency (%)
0 105117
4.6%
1 2073
 
0.1%
2 429
 
< 0.1%
3 146
 
< 0.1%
4 90
 
< 0.1%
ValueCountFrequency (%)
21 2
< 0.1%
20 2
< 0.1%
19 1
< 0.1%
18 2
< 0.1%
17 2
< 0.1%

sec_app_collections_12_mths_ex_med
Real number (ℝ)

MISSING  ZEROS 

Distinct18
Distinct (%)< 0.1%
Missing2152647
Missing (%)95.2%
Infinite0
Infinite (%)0.0%
Mean0.07756825062
Minimum0
Maximum23
Zeros101793
Zeros (%)4.5%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:51.275078image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum23
Range23
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.4079956247
Coefficient of variation (CV)5.25982759
Kurtosis362.2128631
Mean0.07756825062
Median Absolute Deviation (MAD)0
Skewness13.30092154
Sum8379
Variance0.1664604298
MonotonicityNot monotonic
2023-04-16T23:54:51.353636image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
0 101793
 
4.5%
1 5039
 
0.2%
2 799
 
< 0.1%
3 204
 
< 0.1%
4 79
 
< 0.1%
5 31
 
< 0.1%
6 25
 
< 0.1%
8 15
 
< 0.1%
10 11
 
< 0.1%
7 7
 
< 0.1%
Other values (8) 18
 
< 0.1%
(Missing) 2152647
95.2%
ValueCountFrequency (%)
0 101793
4.5%
1 5039
 
0.2%
2 799
 
< 0.1%
3 204
 
< 0.1%
4 79
 
< 0.1%
ValueCountFrequency (%)
23 1
< 0.1%
19 1
< 0.1%
18 1
< 0.1%
16 1
< 0.1%
15 2
< 0.1%

sec_app_earliest_cr_line
Categorical

HIGH CARDINALITY  IMBALANCE 

Distinct664
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.2 MiB
2152647 
Aug-2006
 
998
Aug-2005
 
894
Sep-2006
 
894
Sep-2005
 
889
Other values (659)
 
104346

Unique

Unique48 ?
Unique (%)< 0.1%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
2152647
95.2%
Aug-2006 998
 
< 0.1%
Aug-2005 894
 
< 0.1%
Sep-2006 894
 
< 0.1%
Sep-2005 889
 
< 0.1%
Sep-2004 848
 
< 0.1%
Aug-2004 822
 
< 0.1%
Oct-2005 816
 
< 0.1%
Aug-2007 804
 
< 0.1%
Jul-2006 768
 
< 0.1%
Other values (654) 100288
 
4.4%

sec_app_inq_last_6mths
Real number (ℝ)

MISSING  ZEROS 

Distinct7
Distinct (%)< 0.1%
Missing2152647
Missing (%)95.2%
Infinite0
Infinite (%)0.0%
Mean0.6332564964
Minimum0
Maximum6
Zeros65252
Zeros (%)2.9%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:51.416855image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile3
Maximum6
Range6
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.9934011556
Coefficient of variation (CV)1.56871846
Kurtosis5.007834225
Mean0.6332564964
Median Absolute Deviation (MAD)0
Skewness2.04610811
Sum68405
Variance0.986845856
MonotonicityNot monotonic
2023-04-16T23:54:51.479880image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
0 65252
 
2.9%
1 26964
 
1.2%
2 9698
 
0.4%
3 3643
 
0.2%
4 1509
 
0.1%
5 650
 
< 0.1%
6 305
 
< 0.1%
(Missing) 2152647
95.2%
ValueCountFrequency (%)
0 65252
2.9%
1 26964
1.2%
2 9698
 
0.4%
3 3643
 
0.2%
4 1509
 
0.1%
ValueCountFrequency (%)
6 305
 
< 0.1%
5 650
 
< 0.1%
4 1509
 
0.1%
3 3643
 
0.2%
2 9698
0.4%

sec_app_mort_acc
Real number (ℝ)

MISSING  ZEROS 

Distinct23
Distinct (%)< 0.1%
Missing2152647
Missing (%)95.2%
Infinite0
Infinite (%)0.0%
Mean1.538997047
Minimum0
Maximum27
Zeros42218
Zeros (%)1.9%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:51.558044image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q32
95-th percentile5
Maximum27
Range27
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.760568571
Coefficient of variation (CV)1.143971377
Kurtosis3.510018466
Mean1.538997047
Median Absolute Deviation (MAD)1
Skewness1.451643251
Sum166244
Variance3.099601694
MonotonicityNot monotonic
2023-04-16T23:54:51.643230image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
0 42218
 
1.9%
1 21072
 
0.9%
2 17790
 
0.8%
3 12141
 
0.5%
4 7379
 
0.3%
5 3926
 
0.2%
6 1939
 
0.1%
7 851
 
< 0.1%
8 373
 
< 0.1%
9 165
 
< 0.1%
Other values (13) 167
 
< 0.1%
(Missing) 2152647
95.2%
ValueCountFrequency (%)
0 42218
1.9%
1 21072
0.9%
2 17790
0.8%
3 12141
 
0.5%
4 7379
 
0.3%
ValueCountFrequency (%)
27 1
< 0.1%
23 1
< 0.1%
22 1
< 0.1%
20 1
< 0.1%
18 2
< 0.1%
Distinct140
Distinct (%)0.4%
Missing2224726
Missing (%)98.4%
Infinite0
Infinite (%)0.0%
Mean36.93792777
Minimum0
Maximum185
Zeros399
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:51.734446image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q116
median36
Q356
95-th percentile76
Maximum185
Range185
Interquartile range (IQR)40

Descriptive statistics

Standard deviation23.92458363
Coefficient of variation (CV)0.6476969629
Kurtosis-0.7008442253
Mean36.93792777
Median Absolute Deviation (MAD)20
Skewness0.28522913
Sum1327623
Variance572.385702
MonotonicityNot monotonic
2023-04-16T23:54:51.824378image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1104
 
< 0.1%
2 633
 
< 0.1%
8 575
 
< 0.1%
5 552
 
< 0.1%
13 523
 
< 0.1%
9 522
 
< 0.1%
15 509
 
< 0.1%
4 509
 
< 0.1%
43 507
 
< 0.1%
14 504
 
< 0.1%
Other values (130) 30004
 
1.3%
(Missing) 2224726
98.4%
ValueCountFrequency (%)
0 399
 
< 0.1%
1 1104
< 0.1%
2 633
< 0.1%
3 484
< 0.1%
4 509
< 0.1%
ValueCountFrequency (%)
185 1
< 0.1%
159 1
< 0.1%
153 1
< 0.1%
147 1
< 0.1%
143 1
< 0.1%

sec_app_num_rev_accts
Real number (ℝ)

Distinct86
Distinct (%)0.1%
Missing2152647
Missing (%)95.2%
Infinite0
Infinite (%)0.0%
Mean12.53307227
Minimum0
Maximum106
Zeros580
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:51.928421image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3
Q17
median11
Q317
95-th percentile28
Maximum106
Range106
Interquartile range (IQR)10

Descriptive statistics

Standard deviation8.150963551
Coefficient of variation (CV)0.650356383
Kurtosis3.823558297
Mean12.53307227
Median Absolute Deviation (MAD)5
Skewness1.429661781
Sum1353835
Variance66.43820681
MonotonicityNot monotonic
2023-04-16T23:54:52.013468image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8 6639
 
0.3%
9 6543
 
0.3%
7 6473
 
0.3%
10 6282
 
0.3%
6 6221
 
0.3%
11 5880
 
0.3%
5 5619
 
0.2%
12 5569
 
0.2%
13 5178
 
0.2%
4 4922
 
0.2%
Other values (76) 48695
 
2.2%
(Missing) 2152647
95.2%
ValueCountFrequency (%)
0 580
 
< 0.1%
1 1741
 
0.1%
2 2730
0.1%
3 3822
0.2%
4 4922
0.2%
ValueCountFrequency (%)
106 1
< 0.1%
96 1
< 0.1%
95 1
< 0.1%
92 1
< 0.1%
90 1
< 0.1%

sec_app_open_acc
Real number (ℝ)

Distinct67
Distinct (%)0.1%
Missing2152647
Missing (%)95.2%
Infinite0
Infinite (%)0.0%
Mean11.46945501
Minimum0
Maximum82
Zeros95
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:52.108596image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3
Q17
median10
Q315
95-th percentile24
Maximum82
Range82
Interquartile range (IQR)8

Descriptive statistics

Standard deviation6.627271136
Coefficient of variation (CV)0.5778191839
Kurtosis2.577122093
Mean11.46945501
Median Absolute Deviation (MAD)4
Skewness1.187173558
Sum1238942
Variance43.92072271
MonotonicityNot monotonic
2023-04-16T23:54:52.203845image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9 7711
 
0.3%
8 7690
 
0.3%
7 7336
 
0.3%
10 7297
 
0.3%
11 6786
 
0.3%
6 6640
 
0.3%
12 6331
 
0.3%
5 5889
 
0.3%
13 5623
 
0.2%
14 5030
 
0.2%
Other values (57) 41688
 
1.8%
(Missing) 2152647
95.2%
ValueCountFrequency (%)
0 95
 
< 0.1%
1 1499
 
0.1%
2 2553
0.1%
3 3740
0.2%
4 4812
0.2%
ValueCountFrequency (%)
82 1
< 0.1%
73 1
< 0.1%
67 1
< 0.1%
66 1
< 0.1%
65 1
< 0.1%

sec_app_open_act_il
Real number (ℝ)

Distinct40
Distinct (%)< 0.1%
Missing2152647
Missing (%)95.2%
Infinite0
Infinite (%)0.0%
Mean3.010553503
Minimum0
Maximum43
Zeros14376
Zeros (%)0.6%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:52.282581image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q34
95-th percentile9
Maximum43
Range43
Interquartile range (IQR)3

Descriptive statistics

Standard deviation3.275893064
Coefficient of variation (CV)1.088136471
Kurtosis11.81614557
Mean3.010553503
Median Absolute Deviation (MAD)1
Skewness2.800302479
Sum325203
Variance10.73147537
MonotonicityNot monotonic
2023-04-16T23:54:52.376733image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
1 24089
 
1.1%
2 23099
 
1.0%
3 16430
 
0.7%
0 14376
 
0.6%
4 9942
 
0.4%
5 5905
 
0.3%
6 3593
 
0.2%
7 2361
 
0.1%
8 1684
 
0.1%
9 1307
 
0.1%
Other values (30) 5235
 
0.2%
(Missing) 2152647
95.2%
ValueCountFrequency (%)
0 14376
0.6%
1 24089
1.1%
2 23099
1.0%
3 16430
0.7%
4 9942
0.4%
ValueCountFrequency (%)
43 1
 
< 0.1%
39 4
< 0.1%
38 1
 
< 0.1%
36 1
 
< 0.1%
35 1
 
< 0.1%

sec_app_revol_util
Real number (ℝ)

Distinct1216
Distinct (%)1.1%
Missing2154484
Missing (%)95.3%
Infinite0
Infinite (%)0.0%
Mean58.16910081
Minimum0
Maximum434.3
Zeros1182
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:52.455296image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile12.4
Q139.8
median60.2
Q378.6
95-th percentile95.9
Maximum434.3
Range434.3
Interquartile range (IQR)38.8

Descriptive statistics

Standard deviation25.54821161
Coefficient of variation (CV)0.4392058887
Kurtosis-0.1960062169
Mean58.16910081
Median Absolute Deviation (MAD)19.3
Skewness-0.2564452211
Sum6176627.8
Variance652.7111167
MonotonicityNot monotonic
2023-04-16T23:54:52.549598image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1182
 
0.1%
59.4 180
 
< 0.1%
62.4 179
 
< 0.1%
67.2 174
 
< 0.1%
70.1 173
 
< 0.1%
77.9 172
 
< 0.1%
70.7 170
 
< 0.1%
73.5 170
 
< 0.1%
68.8 170
 
< 0.1%
69.8 168
 
< 0.1%
Other values (1206) 103446
 
4.6%
(Missing) 2154484
95.3%
ValueCountFrequency (%)
0 1182
0.1%
0.1 58
 
< 0.1%
0.2 36
 
< 0.1%
0.3 42
 
< 0.1%
0.4 33
 
< 0.1%
ValueCountFrequency (%)
434.3 1
< 0.1%
235.3 1
< 0.1%
212.6 1
< 0.1%
191 1
< 0.1%
184.2 1
< 0.1%

settlement_amount
Real number (ℝ)

Distinct21519
Distinct (%)65.1%
Missing2227612
Missing (%)98.5%
Infinite0
Infinite (%)0.0%
Mean5030.606922
Minimum44.21
Maximum33601
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:52.645683image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum44.21
5-th percentile794
Q12227
median4172.855
Q36870.7825
95-th percentile12375.5875
Maximum33601
Range33556.79
Interquartile range (IQR)4643.7825

Descriptive statistics

Standard deviation3692.027842
Coefficient of variation (CV)0.7339130047
Kurtosis2.172350055
Mean5030.606922
Median Absolute Deviation (MAD)2205.035
Skewness1.307812535
Sum166291742.4
Variance13631069.58
MonotonicityNot monotonic
2023-04-16T23:54:52.754233image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5000 63
 
< 0.1%
4000 51
 
< 0.1%
3000 49
 
< 0.1%
6000 49
 
< 0.1%
8000 44
 
< 0.1%
10000 44
 
< 0.1%
7000 41
 
< 0.1%
3500 39
 
< 0.1%
6500 35
 
< 0.1%
7500 33
 
< 0.1%
Other values (21509) 32608
 
1.4%
(Missing) 2227612
98.5%
ValueCountFrequency (%)
44.21 1
< 0.1%
60.84 1
< 0.1%
82.96 1
< 0.1%
107 1
< 0.1%
120 1
< 0.1%
ValueCountFrequency (%)
33601 1
< 0.1%
30000 1
< 0.1%
28503 1
< 0.1%
28000 1
< 0.1%
27850 1
< 0.1%

settlement_date
Categorical

HIGH CARDINALITY  IMBALANCE  MISSING 

Distinct90
Distinct (%)< 0.1%
Missing92352
Missing (%)4.1%
Memory size17.2 MiB
2135260 
Jan-2019
 
1725
Oct-2018
 
1532
Mar-2018
 
1407
Sep-2018
 
1406
Other values (85)
 
26986

Unique

Unique7 ?
Unique (%)< 0.1%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
2135260
94.5%
Jan-2019 1725
 
0.1%
Oct-2018 1532
 
0.1%
Mar-2018 1407
 
0.1%
Sep-2018 1406
 
0.1%
Nov-2018 1400
 
0.1%
Jun-2018 1396
 
0.1%
Jan-2018 1391
 
0.1%
Aug-2018 1388
 
0.1%
May-2018 1361
 
0.1%
Other values (80) 20050
 
0.9%
(Missing) 92352
 
4.1%

settlement_percentage
Real number (ℝ)

Distinct2045
Distinct (%)6.2%
Missing2227612
Missing (%)98.5%
Infinite0
Infinite (%)0.0%
Mean47.77559959
Minimum0.2
Maximum521.35
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:52.848525image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0.2
5-th percentile40
Q145
median45
Q350
95-th percentile60.01
Maximum521.35
Range521.15
Interquartile range (IQR)5

Descriptive statistics

Standard deviation7.336378604
Coefficient of variation (CV)0.1535591111
Kurtosis534.1818854
Mean47.77559959
Median Absolute Deviation (MAD)4.835
Skewness9.32180216
Sum1579270.22
Variance53.82245102
MonotonicityNot monotonic
2023-04-16T23:54:52.944794image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
45 11612
 
0.5%
50 5539
 
0.2%
40 2131
 
0.1%
45.01 1567
 
0.1%
60 1230
 
0.1%
55 1089
 
< 0.1%
50.01 988
 
< 0.1%
44.99 821
 
< 0.1%
65 678
 
< 0.1%
49.99 457
 
< 0.1%
Other values (2035) 6944
 
0.3%
(Missing) 2227612
98.5%
ValueCountFrequency (%)
0.2 1
< 0.1%
0.45 1
< 0.1%
0.55 1
< 0.1%
0.65 1
< 0.1%
10.69 1
< 0.1%
ValueCountFrequency (%)
521.35 1
 
< 0.1%
184.36 1
 
< 0.1%
166.67 1
 
< 0.1%
100 3
< 0.1%
98.57 1
 
< 0.1%

settlement_status
Categorical

IMBALANCE  MISSING 

Distinct4
Distinct (%)< 0.1%
Missing92352
Missing (%)4.1%
Memory size17.2 MiB
2135260 
ACTIVE
 
14811
COMPLETE
 
13517
BROKEN
 
4728

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
2135260
94.5%
ACTIVE 14811
 
0.7%
COMPLETE 13517
 
0.6%
BROKEN 4728
 
0.2%
(Missing) 92352
 
4.1%

Common Values (Plot)

2023-04-16T23:54:53.044653image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

settlement_term
Real number (ℝ)

Distinct40
Distinct (%)0.1%
Missing2227612
Missing (%)98.5%
Infinite0
Infinite (%)0.0%
Mean13.14859632
Minimum0
Maximum181
Zeros2729
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:53.119229image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q16
median14
Q318
95-th percentile24
Maximum181
Range181
Interquartile range (IQR)12

Descriptive statistics

Standard deviation8.192318686
Coefficient of variation (CV)0.6230565215
Kurtosis5.776580814
Mean13.14859632
Median Absolute Deviation (MAD)6
Skewness0.1863272482
Sum434640
Variance67.11408546
MonotonicityNot monotonic
2023-04-16T23:54:53.197463image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
18 6608
 
0.3%
24 6194
 
0.3%
12 4662
 
0.2%
1 3021
 
0.1%
0 2729
 
0.1%
6 1907
 
0.1%
16 1394
 
0.1%
10 1166
 
0.1%
8 1067
 
< 0.1%
14 931
 
< 0.1%
Other values (30) 3377
 
0.1%
(Missing) 2227612
98.5%
ValueCountFrequency (%)
0 2729
0.1%
1 3021
0.1%
2 270
 
< 0.1%
3 237
 
< 0.1%
4 514
 
< 0.1%
ValueCountFrequency (%)
181 1
 
< 0.1%
118 1
 
< 0.1%
112 1
 
< 0.1%
65 3
< 0.1%
60 1
 
< 0.1%

sub_grade
Categorical

Distinct35
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.2 MiB
C1
 
145903
B5
 
140288
B4
 
139793
B3
 
131514
C2
 
131116
Other values (30)
1572054 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowC1
2nd rowD2
3rd rowD1
4th rowD2
5th rowC4

Common Values

ValueCountFrequency (%)
C1 145903
 
6.5%
B5 140288
 
6.2%
B4 139793
 
6.2%
B3 131514
 
5.8%
C2 131116
 
5.8%
C3 129193
 
5.7%
C4 127115
 
5.6%
B2 126621
 
5.6%
B1 125341
 
5.5%
C5 116726
 
5.2%
Other values (25) 947058
41.9%

tax_liens
Real number (ℝ)

SKEWED  ZEROS 

Distinct42
Distinct (%)< 0.1%
Missing105
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean0.04677109198
Minimum0
Maximum85
Zeros2195933
Zeros (%)97.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:53.292782image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum85
Range85
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.377533821
Coefficient of variation (CV)8.071947971
Kurtosis3476.326736
Mean0.04677109198
Median Absolute Deviation (MAD)0
Skewness32.07091145
Sum105729
Variance0.142531786
MonotonicityNot monotonic
2023-04-16T23:54:53.371272image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=42)
ValueCountFrequency (%)
0 2195933
97.1%
1 43638
 
1.9%
2 12172
 
0.5%
3 4456
 
0.2%
4 2007
 
0.1%
5 1026
 
< 0.1%
6 557
 
< 0.1%
7 265
 
< 0.1%
8 160
 
< 0.1%
9 103
 
< 0.1%
Other values (32) 246
 
< 0.1%
(Missing) 105
 
< 0.1%
ValueCountFrequency (%)
0 2195933
97.1%
1 43638
 
1.9%
2 12172
 
0.5%
3 4456
 
0.2%
4 2007
 
0.1%
ValueCountFrequency (%)
85 1
< 0.1%
63 1
< 0.1%
61 2
< 0.1%
53 1
< 0.1%
52 1
< 0.1%

term
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.2 MiB
36 months
1609754 
60 months
650914 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row36 months
2nd row60 months
3rd row36 months
4th row36 months
5th row60 months

Common Values

ValueCountFrequency (%)
36 months 1609754
71.2%
60 months 650914
28.8%

Common Values (Plot)

2023-04-16T23:54:53.449818image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

title
Categorical

HIGH CARDINALITY  IMBALANCE 

Distinct61465
Distinct (%)2.7%
Missing1
Missing (%)< 0.1%
Memory size17.2 MiB
Debt consolidation
1153644 
Credit card refinancing
469701 
Home improvement
137495 
Other
127714 
Major purchase
 
44840
Other values (61460)
327273 

Unique

Unique52514 ?
Unique (%)2.3%

Sample

1st rowDebt consolidation
2nd rowDebt consolidation
3rd rowDebt consolidation
4th rowDebt consolidation
5th rowDebt consolidation

Common Values

ValueCountFrequency (%)
Debt consolidation 1153644
51.0%
Credit card refinancing 469701
20.8%
Home improvement 137495
 
6.1%
Other 127714
 
5.6%
Major purchase 44840
 
2.0%
Medical expenses 25389
 
1.1%
23322
 
1.0%
Business 20822
 
0.9%
Car financing 20526
 
0.9%
Debt Consolidation 16417
 
0.7%
Other values (61455) 220797
 
9.8%

tot_coll_amt
Real number (ℝ)

MISSING  SKEWED  ZEROS 

Distinct15574
Distinct (%)0.7%
Missing70276
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean232.7317389
Minimum0
Maximum9152545
Zeros1856129
Zeros (%)82.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:53.529158image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile786
Maximum9152545
Range9152545
Interquartile range (IQR)0

Descriptive statistics

Standard deviation8518.461819
Coefficient of variation (CV)36.60206322
Kurtosis803765.2461
Mean232.7317389
Median Absolute Deviation (MAD)0
Skewness852.0101323
Sum509773739
Variance72564191.77
MonotonicityNot monotonic
2023-04-16T23:54:53.637208image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1856129
82.1%
50 3924
 
0.2%
100 3250
 
0.1%
75 2489
 
0.1%
200 1911
 
0.1%
150 1904
 
0.1%
60 1791
 
0.1%
70 1457
 
0.1%
80 1411
 
0.1%
55 1247
 
0.1%
Other values (15564) 314879
 
13.9%
(Missing) 70276
 
3.1%
ValueCountFrequency (%)
0 1856129
82.1%
2 5
 
< 0.1%
3 2
 
< 0.1%
4 1
 
< 0.1%
5 2
 
< 0.1%
ValueCountFrequency (%)
9152545 1
< 0.1%
6214661 1
< 0.1%
5252395 1
< 0.1%
932461 1
< 0.1%
848438 1
< 0.1%

tot_cur_bal
Real number (ℝ)

Distinct487688
Distinct (%)22.3%
Missing70276
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean142492.1952
Minimum0
Maximum9971659
Zeros959
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:53.733721image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile8195.55
Q129092
median79240
Q3213204
95-th percentile441856.45
Maximum9971659
Range9971659
Interquartile range (IQR)184112

Descriptive statistics

Standard deviation160692.6406
Coefficient of variation (CV)1.12772942
Kurtosis33.33453308
Mean142492.1952
Median Absolute Deviation (MAD)63035
Skewness2.974725283
Sum3.121137644 × 1011
Variance2.582212475 × 1010
MonotonicityNot monotonic
2023-04-16T23:54:53.830509image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 959
 
< 0.1%
14186 41
 
< 0.1%
23772 39
 
< 0.1%
20275 39
 
< 0.1%
25197 38
 
< 0.1%
22831 38
 
< 0.1%
19923 38
 
< 0.1%
23442 38
 
< 0.1%
23607 38
 
< 0.1%
20317 38
 
< 0.1%
Other values (487678) 2189086
96.8%
(Missing) 70276
 
3.1%
ValueCountFrequency (%)
0 959
< 0.1%
1 13
 
< 0.1%
2 15
 
< 0.1%
3 19
 
< 0.1%
4 12
 
< 0.1%
ValueCountFrequency (%)
9971659 1
< 0.1%
8524709 1
< 0.1%
8000078 1
< 0.1%
5752177 1
< 0.1%
5445012 1
< 0.1%

tot_hi_cred_lim
Real number (ℝ)

Distinct529972
Distinct (%)24.2%
Missing70276
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean178242.7537
Minimum0
Maximum9999999
Zeros70
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:53.934752image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile18830
Q150731
median114298.5
Q3257755
95-th percentile510883.8
Maximum9999999
Range9999999
Interquartile range (IQR)207024

Descriptive statistics

Standard deviation181574.8147
Coefficient of variation (CV)1.018693949
Kurtosis84.33085943
Mean178242.7537
Median Absolute Deviation (MAD)79017.5
Skewness3.829997886
Sum3.904215019 × 1011
Variance3.296941332 × 1010
MonotonicityNot monotonic
2023-04-16T23:54:54.034230image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
12500 716
 
< 0.1%
15000 712
 
< 0.1%
15500 697
 
< 0.1%
19000 696
 
< 0.1%
16500 690
 
< 0.1%
16000 690
 
< 0.1%
13500 690
 
< 0.1%
11000 686
 
< 0.1%
13000 684
 
< 0.1%
17500 684
 
< 0.1%
Other values (529962) 2183447
96.6%
(Missing) 70276
 
3.1%
ValueCountFrequency (%)
0 70
< 0.1%
100 1
 
< 0.1%
119 1
 
< 0.1%
154 1
 
< 0.1%
200 16
 
< 0.1%
ValueCountFrequency (%)
9999999 14
< 0.1%
9792792 1
 
< 0.1%
9375662 1
 
< 0.1%
8700253 1
 
< 0.1%
8592561 1
 
< 0.1%

total_acc
Real number (ℝ)

Distinct152
Distinct (%)< 0.1%
Missing29
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean24.16255227
Minimum1
Maximum176
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:54.130509image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8
Q115
median22
Q331
95-th percentile46
Maximum176
Range175
Interquartile range (IQR)16

Descriptive statistics

Standard deviation11.98752832
Coefficient of variation (CV)0.4961201194
Kurtosis1.848815197
Mean24.16255227
Median Absolute Deviation (MAD)8
Skewness1.007455501
Sum54622808
Variance143.7008352
MonotonicityNot monotonic
2023-04-16T23:54:54.226153image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20 82570
 
3.7%
19 82012
 
3.6%
18 81931
 
3.6%
17 81378
 
3.6%
21 81170
 
3.6%
16 79655
 
3.5%
22 79438
 
3.5%
23 77691
 
3.4%
15 77146
 
3.4%
24 75330
 
3.3%
Other values (142) 1462318
64.7%
ValueCountFrequency (%)
1 21
 
< 0.1%
2 1333
 
0.1%
3 4244
 
0.2%
4 10456
0.5%
5 16398
0.7%
ValueCountFrequency (%)
176 1
< 0.1%
173 1
< 0.1%
169 1
< 0.1%
165 1
< 0.1%
162 1
< 0.1%

total_bal_ex_mort
Real number (ℝ)

Distinct212777
Distinct (%)9.6%
Missing50030
Missing (%)2.2%
Infinite0
Infinite (%)0.0%
Mean51022.93846
Minimum0
Maximum3408095
Zeros1584
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:54.321082image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile6556.85
Q120892
median37864
Q364350
95-th percentile138768.15
Maximum3408095
Range3408095
Interquartile range (IQR)43458

Descriptive statistics

Standard deviation49911.23567
Coefficient of variation (CV)0.9782117058
Kurtosis65.58390119
Mean51022.93846
Median Absolute Deviation (MAD)19944
Skewness4.236560408
Sum1.127932466 × 1011
Variance2491131446
MonotonicityNot monotonic
2023-04-16T23:54:54.413865image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1584
 
0.1%
23068 59
 
< 0.1%
24214 57
 
< 0.1%
20275 57
 
< 0.1%
20346 56
 
< 0.1%
19095 56
 
< 0.1%
25529 55
 
< 0.1%
22831 55
 
< 0.1%
20317 55
 
< 0.1%
19217 54
 
< 0.1%
Other values (212767) 2208550
97.7%
(Missing) 50030
 
2.2%
ValueCountFrequency (%)
0 1584
0.1%
1 23
 
< 0.1%
2 23
 
< 0.1%
3 27
 
< 0.1%
4 14
 
< 0.1%
ValueCountFrequency (%)
3408095 1
< 0.1%
2921551 1
< 0.1%
2698600 1
< 0.1%
2688920 1
< 0.1%
2652799 1
< 0.1%

total_bal_il
Real number (ℝ)

MISSING  ZEROS 

Distinct162249
Distinct (%)11.6%
Missing866129
Missing (%)38.3%
Infinite0
Infinite (%)0.0%
Mean35506.64527
Minimum0
Maximum1837038
Zeros158666
Zeros (%)7.0%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:54.508831image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q18695
median23127
Q346095
95-th percentile113501.1
Maximum1837038
Range1837038
Interquartile range (IQR)37400

Descriptive statistics

Standard deviation44097.45592
Coefficient of variation (CV)1.241949376
Kurtosis32.73888123
Mean35506.64527
Median Absolute Deviation (MAD)17047
Skewness3.821058228
Sum4.951540158 × 1010
Variance1944585619
MonotonicityNot monotonic
2023-04-16T23:54:54.604023image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 158666
 
7.0%
10000 103
 
< 0.1%
15000 95
 
< 0.1%
5000 86
 
< 0.1%
5500 77
 
< 0.1%
9500 66
 
< 0.1%
2000 65
 
< 0.1%
6000 64
 
< 0.1%
20000 64
 
< 0.1%
7000 64
 
< 0.1%
Other values (162239) 1235189
54.6%
(Missing) 866129
38.3%
ValueCountFrequency (%)
0 158666
7.0%
1 58
 
< 0.1%
2 12
 
< 0.1%
3 12
 
< 0.1%
4 9
 
< 0.1%
ValueCountFrequency (%)
1837038 1
< 0.1%
1754743 1
< 0.1%
1711009 1
< 0.1%
1547285 1
< 0.1%
1466398 1
< 0.1%

total_bc_limit
Real number (ℝ)

MISSING  ZEROS 

Distinct20309
Distinct (%)0.9%
Missing50030
Missing (%)2.2%
Infinite0
Infinite (%)0.0%
Mean23193.76817
Minimum0
Maximum1569000
Zeros25349
Zeros (%)1.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:54.714421image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2300
Q18300
median16300
Q330300
95-th percentile67300
Maximum1569000
Range1569000
Interquartile range (IQR)22000

Descriptive statistics

Standard deviation23006.55824
Coefficient of variation (CV)0.9919284381
Kurtosis29.91804426
Mean23193.76817
Median Absolute Deviation (MAD)9600
Skewness2.990081245
Sum5.127302529 × 1010
Variance529301722
MonotonicityNot monotonic
2023-04-16T23:54:54.810757image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 25349
 
1.1%
5000 16941
 
0.7%
6000 15110
 
0.7%
10000 14744
 
0.7%
7000 14383
 
0.6%
8000 14055
 
0.6%
4000 13985
 
0.6%
3000 13939
 
0.6%
7500 13427
 
0.6%
9000 13175
 
0.6%
Other values (20299) 2055530
90.9%
(Missing) 50030
 
2.2%
ValueCountFrequency (%)
0 25349
1.1%
100 17
 
< 0.1%
200 260
 
< 0.1%
250 4
 
< 0.1%
251 1
 
< 0.1%
ValueCountFrequency (%)
1569000 1
< 0.1%
1105500 1
< 0.1%
1090700 1
< 0.1%
834300 1
< 0.1%
760000 1
< 0.1%

total_cu_tl
Real number (ℝ)

MISSING  ZEROS 

Distinct62
Distinct (%)< 0.1%
Missing866130
Missing (%)38.3%
Infinite0
Infinite (%)0.0%
Mean1.477304312
Minimum0
Maximum111
Zeros753128
Zeros (%)33.3%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:54.921286image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q32
95-th percentile7
Maximum111
Range111
Interquartile range (IQR)2

Descriptive statistics

Standard deviation2.672991191
Coefficient of variation (CV)1.809370737
Kurtosis24.10976388
Mean1.477304312
Median Absolute Deviation (MAD)0
Skewness3.575038998
Sum2060157
Variance7.144881908
MonotonicityNot monotonic
2023-04-16T23:54:55.502604image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 753128
33.3%
1 235619
 
10.4%
2 129673
 
5.7%
3 82042
 
3.6%
4 55090
 
2.4%
5 38010
 
1.7%
6 27101
 
1.2%
7 19362
 
0.9%
8 13953
 
0.6%
9 10143
 
0.4%
Other values (52) 30417
 
1.3%
(Missing) 866130
38.3%
ValueCountFrequency (%)
0 753128
33.3%
1 235619
 
10.4%
2 129673
 
5.7%
3 82042
 
3.6%
4 55090
 
2.4%
ValueCountFrequency (%)
111 1
 
< 0.1%
79 1
 
< 0.1%
77 1
 
< 0.1%
71 1
 
< 0.1%
68 3
< 0.1%

total_il_high_credit_limit
Real number (ℝ)

MISSING  ZEROS 

Distinct194137
Distinct (%)8.9%
Missing70276
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean43732.01348
Minimum0
Maximum2118996
Zeros263497
Zeros (%)11.7%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:55.597579image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q115000
median32696
Q358804.25
95-th percentile125348.45
Maximum2118996
Range2118996
Interquartile range (IQR)43804.25

Descriptive statistics

Standard deviation45072.98219
Coefficient of variation (CV)1.03066332
Kurtosis28.28428368
Mean43732.01348
Median Absolute Deviation (MAD)20696
Skewness3.101032113
Sum9.579025246 × 1010
Variance2031573724
MonotonicityNot monotonic
2023-04-16T23:54:55.693949image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 263497
 
11.7%
10000 13338
 
0.6%
15000 10273
 
0.5%
20000 8412
 
0.4%
5000 8160
 
0.4%
12000 6496
 
0.3%
25000 6200
 
0.3%
6000 5494
 
0.2%
8000 5136
 
0.2%
7000 3481
 
0.2%
Other values (194127) 1859905
82.3%
(Missing) 70276
 
3.1%
ValueCountFrequency (%)
0 263497
11.7%
36 1
 
< 0.1%
44 1
 
< 0.1%
59 1
 
< 0.1%
75 1
 
< 0.1%
ValueCountFrequency (%)
2118996 1
< 0.1%
2101913 1
< 0.1%
2000000 1
< 0.1%
1840000 1
< 0.1%
1736064 1
< 0.1%

total_pymnt
Real number (ℝ)

Distinct1608439
Distinct (%)71.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11824.0305
Minimum0
Maximum63296.87787
Zeros1008
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:55.787424image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1154.4835
Q14272.58
median9060.869904
Q316707.9727
95-th percentile32490.72685
Maximum63296.87787
Range63296.87787
Interquartile range (IQR)12435.3927

Descriptive statistics

Standard deviation9889.599027
Coefficient of variation (CV)0.8363983016
Kurtosis1.377826568
Mean11824.0305
Median Absolute Deviation (MAD)5618.329904
Skewness1.27502792
Sum2.673020738 × 1010
Variance97804168.92
MonotonicityNot monotonic
2023-04-16T23:54:55.882518image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1008
 
< 0.1%
1215.49 195
 
< 0.1%
1257.3 176
 
< 0.1%
11258.43637 167
 
< 0.1%
1246.16 158
 
< 0.1%
10838.35484 154
 
< 0.1%
1234.95 153
 
< 0.1%
1824.93 149
 
< 0.1%
13510.12859 148
 
< 0.1%
4861.94 146
 
< 0.1%
Other values (1608429) 2258214
99.9%
ValueCountFrequency (%)
0 1008
< 0.1%
0.75 1
 
< 0.1%
0.8 1
 
< 0.1%
10 1
 
< 0.1%
16.58 1
 
< 0.1%
ValueCountFrequency (%)
63296.87787 1
< 0.1%
62948.99096 1
< 0.1%
62884.79738 1
< 0.1%
62862.50673 1
< 0.1%
62837.63969 1
< 0.1%

total_pymnt_inv
Real number (ℝ)

Distinct1299089
Distinct (%)57.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11805.94352
Minimum0
Maximum63296.88
Zeros1286
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:55.978300image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1145.86
Q14257.73
median9043.08
Q316682.5725
95-th percentile32469.93
Maximum63296.88
Range63296.88
Interquartile range (IQR)12424.8425

Descriptive statistics

Standard deviation9884.834668
Coefficient of variation (CV)0.8372761275
Kurtosis1.379693626
Mean11805.94352
Median Absolute Deviation (MAD)5612.04
Skewness1.275889736
Sum2.668931871 × 1010
Variance97709956.42
MonotonicityNot monotonic
2023-04-16T23:54:56.071981image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1286
 
0.1%
11431.12 266
 
< 0.1%
11784.23 265
 
< 0.1%
11258.44 234
 
< 0.1%
11471.31 218
 
< 0.1%
10956.78 214
 
< 0.1%
12128.02 211
 
< 0.1%
11955.4 205
 
< 0.1%
13510.13 194
 
< 0.1%
1215.49 192
 
< 0.1%
Other values (1299079) 2257383
99.9%
ValueCountFrequency (%)
0 1286
0.1%
0.51 1
 
< 0.1%
0.54 1
 
< 0.1%
0.75 1
 
< 0.1%
0.8 1
 
< 0.1%
ValueCountFrequency (%)
63296.88 1
< 0.1%
62904.03 1
< 0.1%
62862.51 1
< 0.1%
62839.88 1
< 0.1%
62837.64 1
< 0.1%

total_rec_int
Real number (ℝ)

Distinct629835
Distinct (%)27.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2386.351954
Minimum0
Maximum28192.5
Zeros2657
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:56.169273image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile183.51
Q1693.61
median1485.28
Q33052.22
95-th percentile7740.91
Maximum28192.5
Range28192.5
Interquartile range (IQR)2358.61

Descriptive statistics

Standard deviation2663.086087
Coefficient of variation (CV)1.115965347
Kurtosis9.257457118
Mean2386.351954
Median Absolute Deviation (MAD)969.99
Skewness2.565760023
Sum5394749498
Variance7092027.505
MonotonicityNot monotonic
2023-04-16T23:54:56.249141image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2657
 
0.1%
1431.12 316
 
< 0.1%
1784.23 310
 
< 0.1%
1955.4 246
 
< 0.1%
956.78 245
 
< 0.1%
1258.44 244
 
< 0.1%
1471.31 239
 
< 0.1%
1977.77 229
 
< 0.1%
2128.02 228
 
< 0.1%
2862.28 224
 
< 0.1%
Other values (629825) 2255730
99.8%
ValueCountFrequency (%)
0 2657
0.1%
0.01 28
 
< 0.1%
0.06 1
 
< 0.1%
0.07 2
 
< 0.1%
0.12 1
 
< 0.1%
ValueCountFrequency (%)
28192.5 1
< 0.1%
27948.99 1
< 0.1%
27884.8 1
< 0.1%
27862.51 1
< 0.1%
27837.64 1
< 0.1%

total_rec_late_fee
Real number (ℝ)

SKEWED  ZEROS 

Distinct17991
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.462468888
Minimum-9.5 × 10-9
Maximum1427.25
Zeros2176107
Zeros (%)96.3%
Negative8
Negative (%)< 0.1%
Memory size17.2 MiB
2023-04-16T23:54:56.349030image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum-9.5 × 10-9
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum1427.25
Range1427.25
Interquartile range (IQR)0

Descriptive statistics

Standard deviation11.50209505
Coefficient of variation (CV)7.864847685
Kurtosis953.9855236
Mean1.462468888
Median Absolute Deviation (MAD)0
Skewness21.84586707
Sum3306156.615
Variance132.2981904
MonotonicityNot monotonic
2023-04-16T23:54:56.435905image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2176107
96.3%
15 17687
 
0.8%
30 3656
 
0.2%
45 1223
 
0.1%
60 508
 
< 0.1%
75 250
 
< 0.1%
16.61 129
 
< 0.1%
90 126
 
< 0.1%
16.37 115
 
< 0.1%
15.94 111
 
< 0.1%
Other values (17981) 60756
 
2.7%
ValueCountFrequency (%)
-9.5 × 10-91
< 0.1%
-5.1 × 10-91
< 0.1%
-3.9 × 10-91
< 0.1%
-2 × 10-91
< 0.1%
-1.8 × 10-91
< 0.1%
ValueCountFrequency (%)
1427.25 1
< 0.1%
1188.83 1
< 0.1%
1098.360001 1
< 0.1%
955.92 1
< 0.1%
936.6 1
< 0.1%

total_rec_prncp
Real number (ℝ)

Distinct487427
Distinct (%)21.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9300.142079
Minimum0
Maximum40000
Zeros2575
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:56.530817image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile699.24
Q12846.18
median6823.385
Q313397.5
95-th percentile27000
Maximum40000
Range40000
Interquartile range (IQR)10551.32

Descriptive statistics

Standard deviation8304.885568
Coefficient of variation (CV)0.8929848057
Kurtosis1.156052338
Mean9300.142079
Median Absolute Deviation (MAD)4676.615
Skewness1.268459094
Sum2.102453359 × 1010
Variance68971124.3
MonotonicityNot monotonic
2023-04-16T23:54:56.645070image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10000 78590
 
3.5%
12000 57687
 
2.6%
15000 54801
 
2.4%
20000 53752
 
2.4%
5000 39524
 
1.7%
8000 38557
 
1.7%
35000 37475
 
1.7%
6000 37150
 
1.6%
16000 27869
 
1.2%
25000 26032
 
1.2%
Other values (487417) 1809231
80.0%
ValueCountFrequency (%)
0 2575
0.1%
0.01 1
 
< 0.1%
2.13 1
 
< 0.1%
5.03 1
 
< 0.1%
5.62 1
 
< 0.1%
ValueCountFrequency (%)
40000 4941
0.2%
39999.93 1
 
< 0.1%
39989.79 1
 
< 0.1%
39975 4
 
< 0.1%
39950.61 1
 
< 0.1%

total_rev_hi_lim
Real number (ℝ)

MISSING  SKEWED 

Distinct34220
Distinct (%)1.6%
Missing70276
Missing (%)3.1%
Infinite0
Infinite (%)0.0%
Mean34573.94277
Minimum0
Maximum9999999
Zeros1366
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size17.2 MiB
2023-04-16T23:54:56.750315image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile6000
Q114700
median25400
Q343200
95-th percentile91100
Maximum9999999
Range9999999
Interquartile range (IQR)28500

Descriptive statistics

Standard deviation36728.49545
Coefficient of variation (CV)1.06231724
Kurtosis7520.926598
Mean34573.94277
Median Absolute Deviation (MAD)12800
Skewness32.55742738
Sum7.573048765 × 1010
Variance1348982378
MonotonicityNot monotonic
2023-04-16T23:54:56.845577image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10000 7086
 
0.3%
15000 6840
 
0.3%
13000 6720
 
0.3%
12000 6657
 
0.3%
14000 6640
 
0.3%
11000 6624
 
0.3%
11500 6591
 
0.3%
16000 6535
 
0.3%
17000 6526
 
0.3%
12500 6496
 
0.3%
Other values (34210) 2123677
93.9%
(Missing) 70276
 
3.1%
ValueCountFrequency (%)
0 1366
0.1%
100 28
 
< 0.1%
200 71
 
< 0.1%
300 354
 
< 0.1%
400 113
 
< 0.1%
ValueCountFrequency (%)
9999999 3
< 0.1%
2175000 1
 
< 0.1%
2087500 1
 
< 0.1%
2059200 1
 
< 0.1%
2013133 1
 
< 0.1%

url
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2260668
Missing (%)100.0%
Memory size17.2 MiB
Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.2 MiB
Source Verified
886231 
Not Verified
744806 
Verified
629631 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNot Verified
2nd rowSource Verified
3rd rowSource Verified
4th rowSource Verified
5th rowNot Verified

Common Values

ValueCountFrequency (%)
Source Verified 886231
39.2%
Not Verified 744806
32.9%
Verified 629631
27.9%

Common Values (Plot)

2023-04-16T23:54:56.934447image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.2 MiB
2144938 
Not Verified
 
57403
Source Verified
 
34827
Verified
 
23500

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
2144938
94.9%
Not Verified 57403
 
2.5%
Source Verified 34827
 
1.5%
Verified 23500
 
1.0%

Common Values (Plot)

2023-04-16T23:54:57.000262image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

zip_code
Categorical

Distinct957
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.2 MiB
112xx
 
23908
945xx
 
23782
750xx
 
23649
606xx
 
21192
300xx
 
20497
Other values (952)
2147640 

Unique

Unique35 ?
Unique (%)< 0.1%

Sample

1st row109xx
2nd row713xx
3rd row490xx
4th row985xx
5th row212xx

Common Values

ValueCountFrequency (%)
112xx 23908
 
1.1%
945xx 23782
 
1.1%
750xx 23649
 
1.0%
606xx 21192
 
0.9%
300xx 20497
 
0.9%
331xx 19051
 
0.8%
070xx 18316
 
0.8%
770xx 17719
 
0.8%
891xx 17162
 
0.8%
100xx 17103
 
0.8%
Other values (947) 2058289
91.0%